FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGG T C TGAAGAC T AACAT T T TGT GAAG T T G TAAAACAGAAAAC C T G T TAGAftATGTGGT GG T 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
ff| T ATATAT GT TTGT TCAGAC CAT C CT T T CC TACCAAAT GCAGCCCAAAAT CC ATGGCAAAC AAGT C 

|l| TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
Wl ATCAGTT TTGCACAGT GGCAAT T TT GGGACTGATTTAGAACAGAAACTCCATTGGAACCGCGAGG 



G 



ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
T GGAT TAACC CT CT AT GACAC T GCACC T T GCCC TAT T AAC AAT GAACGAACACGGC TACT T T CC A 
GAGATAT T TGATGAAAGGATAAAAT AT TTCT G TAATGAT TATGAT TCTCAGGGAT TGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GAT GAAT G C T GAT AAT CAGGAAAC AT GAAAGAAG C CAT T T GATAGAT TAT T C T AAAGGAT AT CAT 
C AAGAAGACTAT TAAAAACACC T AT GC C TAT AC T T T T T TAT C T CAGAAAAT AAAGTCAAAAGACT 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8,39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMI T T AAEWSMS FS FFG FFLTYI RDFQKI S LRVE ANLHGLTL YDTAPC P INNERTR 
1,l LLSRDI 

Cl Important features: 

Tl Type II transmembrane domain: 

J** amino acids 13-33 

Z 1 ■;. 

if% 

jij Other Transmembrane domains: 

* amino acids 54-73, 94-113, 160-180, 122-141 



N-myristoylation sites. 



amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTGCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
y Si TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
Q CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
|3 CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
Tvi C TCAGGAAT GT GACTGACAC GGAT ATT CT AGCC CT GGAAC GGCGAC TGC T GCAAACCATGGATATGAT 

f(% CATAAGCAAAAAGAAAAGGAT GGC AAT GGCACG GAGAACAAT GTT CC AGAAGGGGGAAG TGCATAACA 

||| AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
s CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
W ATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 
J;; ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

a I! 

T:; GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
:p4 G AAGT T T T G GT C C CAACACATT T C CT T CAT T C T TGT T GGAAT AAT CAT C GT CACAT C CAT CAGAGGAT 

ffj TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATT TTTCACCTTCATAGCATACT C CTTCC CCGT CAGGTGATACTATGACCATGAGTAGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGT AG CT GAGCCAAACACGT AGGAT TTCC GT T TTAAGGTTCACAT GGAAAAGGTTATAGC T TT G 
C CT T GAGAT TGAC T CAT TAAAAT C AGAGACT G T AACAAAAAAAAAAAAAAAAAAAAAGGGC GGC C GC G 
ACT CTAGAGTCGACCT GCAGAAG C T TGGCCGC CAT GGCCCAACTT GT TTAT T GCAGCTTATAAT G 
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FIGURE A 



u 



%l '•} 



MS FL I DS S IM I T SQ I L FFGFGWL FFMRQL FKDYE I RQYWQV I FS VT FAFSC TMFEL 1 1 FE I LGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGATOCPYTYMSYFLROTTDTDILALERRLLQ 
TMDMI ISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDMjEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLL I RMSMPLE YRT 1 1 TE VLGELQFNFYHRWFDVI FLVSAL S S I L FL YLAHKQAPEKQMAP 

Important features : 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 



.!« 425-444 

o 

W N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURES 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGflAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAAT TATGGGGATCACCCT TGTGAGCAAAAAGGCGAACCAGCAGCTGAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAG T T T GCAGCCTAT TGT TACAAC TCATC TGATAC T T GGACTAACTCGT GCAT T CCAGAA 
AT TATCACCACCAAAGATCCCATAT TCAACACTCAAACTGCAACACAAACAACAGAATTTAT TGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
G AAAC TAG C AC CAT G T C T AC AGAAAC T G AAC C AT T T G T T GAAAAT AAAG C AG CAT T C AAG AAT G A 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGC T GGT C T TGGAT T T TGCTAT G T CAAAAGG TATGT GAAGGCC T TC CCT T T TACAAACAAGAAT 
C AGCAGAAGGAAAT GAT CGAAAC CAAAG TAGTAAAGGAGGAGAAG GC CAAT GATAGCAACCC T AA 
T GAGGAATCAAAGAAAAC TGATAAAAACCC AGAAGAG T CCAAGAG TC CAAGCAAAAC TAC CGTG C 
GAT GCCT GGAAGCT GAAGT T TAGAT GAGACAGAAAT GAGGAGACACACCTGAGGCTGG TTTCTTT 
CAT GC TCC T TACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
C CC T T GGT T CCTAAC T GGAATCAGC TCAGGAC T GCCAT TGGAC TAT GGAGT GCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGT T TAGCCCC T GAAAGCCATGGAGAT T C T CATAAC T TGAGACC TAATC T CT G T AAA 
GC TAAAAT AAAG AAATAGAACAAGGC TGAGGATACGACAGTACAC TGTC AGCAGGGACTGTAAAC 
A.CA.GACAGGGTCAAAGTGT TT TCTCTGAACACAT TGAGT T GGAATCACT GT TTAGAACACACACA 
C TT AC TTTTTCTGGTCTC TACCAC T GCT GAT AT T TT C T C T AGGAAAT AT AC T T T TACAAGTAACA 
AAAAT AAAAAC T C T T AT AAAT TTCTATTTTTATCT GAG T T AC AGAAAT GAT TAC T AAG G AAG AT T 
ACT CAG TAAT T T GT T T AAAAAGT AATAAAAT TCAACAAACAT T T GC T GAATAGC TAC TATAT GT C 
AAGTGCTGTGCAAGGTATTACACTCTGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T T T TAT T T T T G C T GAG AC TAAT C T TAT T CAT T T T C T C TAAT AT GG C AAC CAT TAT AAC C T TAAT T 
TAT TAT TAACAT AC C T AAGAAGT ACAT TGT TACCTC TATAT AC CAAAGCACAT T T TAAAAG T GC C 
AT T AAC AAAT G TAT CAC TAGC CCTCC T T TT TCCAAC AAGAAGGGAC T GAGAGATGCAGAAAT AT T 
TGTGACAAAAAATT AAAGCAT T TAGAAAAC TT 
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FIGURE 6 

MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKAS FETCSYGWVGDG FWI SRI SPNPKCGKNGVGVL I WKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDP1FNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I C VTEVFME T S TMS TE TE P FVENKAAFKNEAAG FGGVPTALLVLALL FFGAAAGLG FC YVK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESK3PSKTTVRCLEAEY 

Signal sequence : 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGGACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGG T TGGAAATAATACCATCCAT G TGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
tjm T GGACAAATGG T CT T T TCAGAGACAG T TATCACATC T GTGGGAGACGAAGAAGGCAGAAGGAGC C 

% ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
y GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
CP ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ly ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
Iff CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
|[| CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
U\ GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 

CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
% a CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
Jj; TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
III CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
C3 CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
ll| GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
{3 T T T G T TCTACAT GGC T TTGATAAT TGT T T GAGGGGAGGAGATGGAAACAAT G T GGAGTC T CC C T C 

f|l| TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
T AC AGGT TAACC TGCAGAAACAGTACT TAGGTAAT T GT AGGGC GAGGAT TATAAATGAAAT T TGC 
AAAATCAC TTAGCAG CAAC TGAAGACAAT TAT CAAC CACGTGGAGAAAATCAAACCGAGCAGGGC 
T G T G T G AAAC AT G G T T G T AAT AT G C GAC TGC G AAC AC T GAAC T C T AC G C C AC T C C AC AAAT GAT G 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT T G TAT AAGCATGC T T T CT T T GAGT T T TAAAT TAT GT ATAAACAT AAGT T GCAT T T AG AA 
ATCAAGCATAAATC AC TT CAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSETOLANLPPSYHNETNT 

TVI TS VGDEEGRRSHEC 1 1 DEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHS LVYVCKPT FVG S RDQDGE I LLPREVPDE YEVGS FMEEVRQE LE DLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 



N-myristoylation site. 

amino acids 202-208, 217-223 



Amidation site. 

amino acids 140-144 
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FIGURE q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
y k GAGT ATAT TGAC CACAAGC T CAGAGG AG T GCAGAAC CC T GTAG CCCGC T GCAT C ATGTGCT GT T T 

O CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
W TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
f l . ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 

^ GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
10 ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
III GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
s GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
i% AGATTCTGGGCAAGAAGAACGAGGCGCCCCGGGACAACAAGAAGAGGAAGAAG TGA CAGCTCCGG 

y i 

CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
|l| CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
O TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 

AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GAT CGCGCCACT GCACTCCAACCTGGGTGACAGAC TCTGTCTC CAAAACAAAACAAACAAACAAA 
AAG AT T T TAT T AAAGAT AT T T T G T T AAC T C 
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FIGURE 10 



RTRGRTRGGCEKVP I NTS CNP TAHLVNS S C PGLMCVFQG YS SKGL I QRS VFNLQI YGVLGL FWT L 
NW VLALGQCVLAGAFAS F YWAFHKPQD IPTFPLISAFIRT LR YH T GS LAFGA1 1 L TL VQ I ARVI L 
E Y I DHKLRGVQNPVARC IMCC FKCCLWCLEKFI KFLNRNAYIMIAI YGKN FCVSAKNAEMLLMRN 
IVRWVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 27 6-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
IZ ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
p% CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
m GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
Lil TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
Ml CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
^ CCAAGGT CCAGGACGCCCAGCCCAACTCGGGTCTGCT GC AGG CCT C G GT CAT C AC C C TC T ACAC CAT G 

fsg 

W TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

f *r. GGG CAAC G AGAC AGT T GTGG C AG G C C C C GAG G G C TAT GAG AC C C AGT GG T GGG AT GC CC C GAG CAT T G 

%s| 

iff TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

V a 

Q AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

ly GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

M AC TTCTGCCTG GT GCT GGC C T CAC T GCAC GT CAT GAT GAC GC T CAC C AAC T GG TACAAGC CCG GT GAG 

ffi 

,JW ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCJGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCA.GGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLV5I IMLSPGVESQL 
YKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLI LVGLTVGAFYI PDGS FTNIWFYFGWGS FLFILIQLVLL I DFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVAMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVI TLYTMFVTWSALSS I PEQKCNPHLPTQLGNETWAGPEGYETQWWDAPS I 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
?<* S FFHFCLVLAS LHVMMTLTNWYKPGETRKMI S TWTAVWVK I CAS WAGLLLYLWTLVAPLLLRNRD 

FS 

O 

5 ; s Signal sequence: 

|f| amino acids 1-20 

^ Transmembrane domains: 

^ amino acids 40-58, 101-116, 134-150, 162-173, 206-223, 240-257, 

ill 272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
G G T T GG AAAAAG AC T C C T G T AAC CC T C C T C C AGG ATGA AC C AC C T G C CAGAAGACAT G GAG AAC G 
C T C T CACC GGGAGC CAGAGC TCC CAT GCTTCTCTGC GCAATAT CCAT T C CATCAACCCCACACAA 
CTCATGGCCAGGAT TGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCT GATGTCAGGAGGAC 
TTTCTGTTTGrTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGATGCAGTATGACTACTATTCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
C T T GCC T GGAT T GAGACGTGGT TCCT GGATT T CAAAG T GT TACC TCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGT T T TAT T CCCCTCCTGAATCCGAAGCAGGAT CT GAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACTTTTAGAACT ATGAG TACTACTTTTGTTAAATGTGAAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCT GTCGACAGTAAAGT T GAAATGGTGACGT C 
CACT GC T GGC T T TAT T GAACAGCT AATAAAGAT T TAT T TAT T G T AAT AC C T CAC AAACG T T GTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGAC T T AAT AT AT T GAAG T AAC AC T T T T T TAG T AAG C AAG AT AC CTTTTTATTT C AAT T CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGAC TT AAACCAT TCATAT CATGTT TCCT T TGCGT T CAGCC AATT TCAATTAAAAT GAA 
CTAAATTAAAAA 
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FIGURE 14 



'JSCS' 



ffl 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI IELNVNGGIENTLEKEVMQYDYYSS YFDI FLLAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



fll 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATC TAA GCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGG 
T T T AGGAACAT GTTTTGCTTTTT T AAAAT AT AT AT AT T T AT AAGAGAT C C T T T C C CAT T T AT T C T 
GGGAAGATGTT T TT CAAACTCAGAGACAAGGAC TT T GGT T T T TGTAAGACAAACGATGATATGAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTAKT^ 
ALRLAGLGLQQL DEGL FS RLRNLHDLDVS DNQLERVPPVI RGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LLAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPG PGLQS PLHAKPY I 

Important features : 
Signal peptide: 

amino acids 1-2 3 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCAXGCG 
GGTCCGGATAGGGC'TGAC'GCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGATGAAT C C T TAGAT T C CAAGACTAC T T T GAC AT CAGATGAG T CAG T A 
AAGGACCATAC T ACTGCAGGC AGAGT AG T T GC T GG T CAAATAT T TCT TGAT TCAGAAGAATC T GA 
AT T AGAATCCT C TAT TC AAGAAGAGGAAGACAGC CT CAAGAGC CAAGAGGGG GAAAGT GT CACAG 
AAGAT AT CAG C T T T C TAG AG T C T C C AAAT C C AGAAAAC AAGGAC TAT GAAG AGC C AAAGAAAG T A 
jU;.- CGGAAACCAGCT TT GACCGCCAT TGAAGGCACAGCACATGGGGAGCCCT GCCAC T TCCCTT TTCT 

O TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
y CTACAACC TAT GAC TAC AAAG CAGAT GAAAAG T GGGGCT T T T G T GAAAC T GAAGAAGAGGC T GCT 

Yl AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
Ifl CAATAAGAAZ^GCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CO C CAAAGCCCTG GAGAGAGTGT CATATGCT C T T T TAT T T GGTGAT TAC TT GC CACAGAATAT CCAG 

U! GCAGCGAGAGAGAT G T T T GAGAAGC T GAC TGAGGAAGGC T C T C C CAAGGGACAGACT GC T C T TGG 

f S2 CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
?S T T GGAGC TCT T GGGGGCAATC TAAT AGCCCACATGGT T T T GGTAAGTAGAC T T TAGT GGAAGGCT 

=3? g 

f"\ AAT AATAT T AACAT CAGAAGAAT T T G T GGT T TATAGCGGC CACAAC T T TT T CAGC TT TCAT GAT C 

|J CAGAT TTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 

C3 AAC ACAT GGAAT CT ACAT GTAAATGAAAGT TGGTGGAGTC CACAAT TT TTC TT TAAAAT GAT TAG 

m 

TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT T G T C AGAATC AT T T T T TAC AT TAGAT TATCAT AAT T T TAAAAAT T T T T CT T TAGT T TTTCA 
AAAT T TT GTAAATGG T GGC TATAGAAAAACAACATGAAATAT TATACAATAT TT TGCAACAATGC 
C CT AAGAAT T G T TAAAAT T CAT G GAG T TAT T T G T G CAG AAT GAC TCCAGAGAGC T C TAC T T T C T G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T T AAT AAAAT TAT G T C T AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDESGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEmYQTGMKILNGSI^KKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVY YT FGALGGNL I AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 



W' N-glycosylation sites . 

amino acids 195-199, 217-221, 272-276 

PI 

w 

Tyrosine kinase phosphorylation site. 

O amino acids 220-228 



N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 



Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 



Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTAT T ATACAAGAAAGGAG TGTACC TAT C AC ACACAG GG G GAAAAATGC T C TTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T T TAT CAC TGGATGTGAC TCGGGCT TT GGAAAC TTGGCAGCCAGAAC T T T T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
5*1 CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
fjj TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
Ifl TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
CP A.TGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 
m T CGCCAT T TGGGAGCAGCT GTC T CC AGACATCAAACAACAATAT GGAGAAGGTT ACAT T GAAAAA 

**h AG T C TAGACAAACT GAAAGGCAATAAATCC TATG T GAACAT GGAC C T C T CT CCGGT GGTAGAGT G 

If! CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
O TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
W GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
ifi AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCT C T TAT CT AAAT AT TAAAAG ATAAGT C AAC CC AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENWRTAQWVKNQVGSKGLWGLINNAGVPGVLAPTDWLTLEDY 
REP IEVNLFGLI S VTLNMLPLVKKAQGRVINVS SVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVWSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVM1D 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 



AppJD= 10063583 



Page 172 



FIGURE 21 



CTGAGGCGGCGGTAGC ATG GAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTATACAATTGA 
CAT TCAGAAATATAT T CCATGCT AT CAGCTT TT TAGCT T TTATAATTCT TCAGGCGAAG T AAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGT CAT TCAGAT CAGATCATGACGT TTAGAGAGAGGCT GCT TCACAAAAACTTGCAGGAGCATT T 
T TCAAACCAAGACCT T G T T TT TCTGCT AT T AACACCAAG T ATAATAACAGAAAGC TGC TCTACT C 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T TT TAGCCGAGC AGTACAAACACAC AGCTC TAAATT TT T T GAAGAAGAT GGATCCT TAAAGGAGG 
TACATAAGATAAAT GAAATGTAT GCT T CAT TACAAGAGGAATTAAAGAGTATATGCAAAAAAGT G 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
AT G T C T T T AAAAAAT AG AC AT G T T T C T AAAAG TAG C T G T AAC T AC AAC C AC CAT C T C GAT G TAG T 
AGACAATC TGACCT TAATGGTAGAACACAC TGACAT TCCT GAAGCTAGTCCAGCTAGTACACCAC 
ty AAATCATTAAGCATAAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTA 
GATACACAAGACAAAC GATCTAAAGCAAATAC T GGTAG TAG TAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGAT GAAGG GT T TTGG T GAATAT TCACGG T CT C 
CTACATT T TGA T CCT T TTAACC T TACAAGGAGATTTT TT TATTTGGCTGAT GGG TAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGTAATACACAGATAACTCTTAGTGCATT TACT TCACAAAGTACTTTT TCAAACATCA 
GAT GCT T T TAT T TCCAAACC T T T T T TT CACC T T T CAC T AAG T T GT TGAGGGGAAGGCT TACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT TGATTGAAGCTAGGAGT TAGAGACCAGCCTGGGCAACGT AT T 
GAGACCATGTCTATTAAAAAATAAAATGGAMAGCAAGAATAGCC T TAT T T TCAAAATAT GGAAA 
GAAAT T T ATATGAAAAT T TAT C T GAG T CAT T AAAAT T C TC C T T AAGT GAT ACT T T T T T AGAAGT A 
CAT TATGGC TAGAGT T GCCAGATAAAAT GC T GGATAT CAT GCAATAAAT T T G CAAAACAT CAT C T 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 



App_ID=10063583 



Page 173 of 320 



FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQL FS FYNS S GEVNEQALKK I LSNVKKNWGWYKFRRHSDQ IMT FRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWMTLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDI PEAS PAST PQI IKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMS SPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family, 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCC7\A 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAAC GAT GCCCCCCTC CACGAGAT C AAC GGT GATC ATT T GAAGAT CTGT CC CC AGGGTT CTAC CTGCT GCT CT 
f *\ CAAGAGAT GGAGGAGAAGT ACAGCCT GCAAAGT AAAGAT GATT T CAAAAGT GT GGT CAGC GAACAGTGCAAT CAT T TG 

ff* CAAGCTGTCTTTGCTT CACGTTACAAGAAGTTT GAT GAATT CTT CAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 

AATGATATGTTTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTAGAGTTG 
AAACGTTACTACGTGGTGGGAAATGT GAACCTGGAAGAAAT GCT AAAT GACTTCTGGGCTCGCCTCCT GGAGCGGATG 
fp TTCCGCCTGGTGAACT CCCAGTACCACTTTACAGATGAGTATCT GGAATGT GTGAGCAAGTATACGGAGCAGCTGAAG 

h| CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

T TAG C GGT T GC GGGAGATGT C GT GAGCAAGGTCT C CGT G GTAAAC CC CAC AGC C CAGT GTAC CCAT GC'C CT GT T GAAG 

•z 

AT GAT CTACTGCTCCCACT GCCGGGGTCTCGTGACT GT GAAGC CAT GT TACAACT ACT GCT CAAACAT CAT GAGAGGC 

W 

I fi TGTTT GGCCAACCAAGGGGATCT CGATTTT GAATGGAACAATTT CATAGAT GCT ATGCTGATGGTGGCAGAGAGGCTA 

GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
|*j AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
f" ; | TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
?! 8 AGT TT GGAC CGACT GGTTACT GATGT CAAGGAGAAACT GAAAC AG GC CAAGAAAT T CT GGTCCTCCCTTCCGAGCAAC 

GTT T GCAAC GAT GAGAGGATGGCTGCAG GAAAC GGC AATGAGGATGACT GT TGGAATGGGAAAGGCAAAAGCAG GTAC 
CT GTT T GCAGT GACAG GAAAT GGATT AGC CAAC CAGGG CAACAACCCAGAGGT CCAGGT TGACACCAGCAAACCAGAC 
ATACT GAT CCTTCGT CAAATCAT GGCTCTT CGAGTGATGACCAGCAAGAT GAAGAATGCATACAATGGGAACGACGTG 
GACTT CTT T GAT AT CAGT GAT GAAAGTAGT GGAGAAGGAAGTGGAAGT GGCTGT GAGTATCAGCAGTGCCCTTCAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGC CTAC CT CCT CACTGTCTT CT GCAT CT TGT TC CT GGTTAT GCAGAGAGAGT G GAGATAAT T CTCAAACTCT GAG 
AAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTACCATCCTAGTGACTTTGCTT 
T GGAC AAC AAT GTAC AGT T T TT ACTATGT GGC CAC T GGT T T AAGAAGT GCT GACTTT GT TT T C T CATT CAGTT TT GGG 
AGGAAAAGGGACTGT GCATT GAGT TGGTTCCTGCT C CCC C AAAC CAT GTT AAACGT GGCTAACAGT GT AGGTACAGAA 
CTATAGTTAGTT GT GCAT T T GTGATT TTAT CACT C T ATTAT TT GTTT GTAT GTT T TTTT CT CATTT CGTTT GT GG GTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CT GAAAT ATTAAAT AGC T GTAC AGAAGCAGGTTTT ATTT ATC AT GTT AT CTTAT TAAAAGAAAAAGCC CAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFMKNDAPLHEINGDHLKICPQGST 
C CS QEMEEK YS LQSKDDFKS WSEQCNHLQAVFASRYKKFDE FFKEL LENAEKS LNDMFVKT YGH 
LYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVS\ n /NPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
hh EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
O KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 



Ifl Important features: 

PI Signal peptide: 

amino acids 1-22 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

GlycosaminoglYcan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 



Glypicans proteins. 

amino acids 54-75, 105-157, 233-280, 309-346, 423-460, 468-506 
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FIGURE 2S 



CTCGCCCT CAAAT G G GAAC GCTGGCCT G GGAC T AAAG C AT AGAC C AC C AG G C T GAG TAT C C T G AC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
T GC T GATGTCCATGG T C TC TAGC AG CCTGAATCCAGGGG T CGC CAGAGGCC ACAGGGACC GAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGC CCCGAGAAGAAAAT T CAT GACAG TGTCTGGGCT GCC AAAGAAGC AGTGCCCC T GT GAT CAT T 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCC T GCCAGCAAT T T CTCAAACAAT GT CAGC TAAGAAGC TTTGCTCTGCCTTTG TAG GAGC TC T G 
AGC GCC CAC T CT TC C AAT TAAAC AT T C T C AGC CAAGAAGAC AG TGAGCACACC T AC CAGACAC T C 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCT AGTG TC AT TT AACCT TAAATGCAATCAGGAAAG T AGCAAACAGAAG T CAATAAATAT T T T T 
AAATGTCAAAAAAAAA&AAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 



f:J N-myristoylation sites. 

O amino acids 27-33, 46-52 

m 

SB 

CO 



:: :: - .v 

w 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCC ATG GACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCA.CTTTGAGA 
1^ AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
Q GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
O CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 

^ TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 

III 

gagcccacctggaaacacattggggatggctgctgcctcaccagagagacgtggaaggatcttga 
gaacgcccagttctccgaaatccaaatggaacgacagccccctcccttgaagtggctacctgttg 

|sj GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
« TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 

GCAGAATGAGAGAAGACAT TC AT GT AC CAC C TAC TAG T CC C TC T C TC CCCAACC T C T GCC AGGGC 
JLj AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
f s $ AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
O CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
III CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 

AT GCCAGAGC AAGAC T C AAAGAGGC AGAGG TTTTGTTCT C AAAT AT T T T T T AAT AAATAGAC GAA 

ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYIiMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGAKFQFYPPGCRVTCLDPNPHFEKFLTKSMAEMRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRV^^ 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern ♦ 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 73-34, 30-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAfiAGGCTCATGC.TTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTT TAT GAT T C T TAT AC TAAT T T ATACAA&GATAT TAAGGC CC TGT TCAT T AAGAAAT T 

u _ GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 

g AGTTAAAAGCTGAAAAAAAAAAAA2AAAAAAAAAAAAAA 

%3 

i[i 
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FIGURE so 



MLLLTLLLLLLLLKGSCLSWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

H 

f*J Growth factor and cytokines receptors family: 

4=1 amino acids 3-18 

m 
w 
m 
m 
w 

£ 

ca- 
in 
o 
w 

13 
III 
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FIGURE 31 



G T T T G AAT T C C T T C AAC T A T AC C C ACAG T C C AAAAG C AG AC T C AC T G T G T C C CAG GC T AC CAG T T 
C CT CCAAGCAAGTCAT TT CCC T TAT T T AACCGATGT G TCC C T CAAACAC C T GAG T GCTAC TC CC T 
AT T T GC AT C T G T T T T GAT AAAT GAT G T T GACAC C C T C C AC C GAAT TC TAAGTGGAAT CATGT CGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCCGAGAAT CAT T G T C AT GT ACAT GCAAAAC GC ACT G AAAGAAC AGCAGCAT GGT GC AT T G T 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AAC CAGAAT GCATATAC TACAACT GCTAT TAATGGGACAGATT TCTGTACATCAGCAAAAGATGC 
AT T C AAAAT C T T G T C C AAG AAC T CAAG T CAC T T T AC AT C TAT T AAC T GC T T T G G AGAC T T C AT AA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GG AAACAAAT GAT GGAT CGT CAGAAAAGC CC TACT T TAT GGAT CAAGAAT T T C T GAG T T T CG T A 
AAAAGGAGCAACAAAT TAAACAAT GCAAGGGC AC AGCAG G ACAAG CAC T CAT T AAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCATTGTGAG ATAGA TACCCAT T TAGGTAT CT GT ACC TGGAAAACAT T 
T CC T TC TAAGAGCCAT T T ACAGAAT AGAAGAT GAGACC AC T AGAGAAAAG T TAG T GAAT T T T T T T 
T TAAAAGACCTAATAAACCCTATTCT TCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSIJ^ 
LDTERENMKCVLGFAIVSTGITAVLW^ 

I L I FF W VL WAVLL S LG T AGAAQVME GGQVE YKP LS G I R YMWS YHLIGLIWTSEFI LACQQMT I A 
GAWTCY FNRSKNDPPDHP ILS SLS I LFF YHQGTWKGS FL I SWR I PRI I VMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKIL3KNSSHFTSINCFGD 
FI I FLGKVLWC FTVFGGLMAFNYNRAFQVWAVP LLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
^ VDLETNDGSSEKPYFNIDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

£3: Important features: 

y-- Signal peptide: 

amino acids 1-2 0 

|.y Putative transmembrane domains: 

? amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

ij! N-glycosylation sites. 

| ; | amino acids 204-208, 295-299, 313-317 

6 

HI N-myristoylation sites. 

amino acids 147-153, 173-184, 196-202, 296-275, 342-348 
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FIGURE 3 3 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
LJ C GAAAAGTCCAGC T G CCCAAGCAGGT GAGACCACAAAAGC C TAT CAGAGGC CACC TAT T CCAGGG 

J*f ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
O CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
ffl GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
hi AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
jm GGATGTCAGCCT GGGAC T T GT T C CAAAAGAAGAATT GAGC ACACAGT C T T T GGAGCCAG TAT CCC 

IS T GGGAGAT CCAAAC T GCAAAAT T GAC T TGT CG T T TT TAAT TGAT GGGAGCACCAGCAT T GG CAAA 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
itf C GGTCC AC TGAT GGGT GT TGT CCAGTATGGAGAC AACCC T GC TAC TCACT T T AAC C T CAAGACAC 

» ACACGAAT T C T C GAGATCT GAAGACAGCCATAGAGAAAAT TAC T CAGAGAGGAGGACT T TC TAAT 

^ GTAGG T CGG GC CAT CTCCTTTGT GAC C AAGAAC TTCTTTTC CAAAGC CAAT G GAAAC AGAAG CGG 

GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
Wl GAC T T GC GAGAGAGT C AGGAATC AACAT T T T C T TCATC AC CAT T GAAGGTGC T GCTGAAAAT GAG 

13 AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
yi GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
r% ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
2? GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTC CAAGCCCAACAAGAGGAAGT TAAT GAT CC T CAT CACC GAC GGGAGGT CC TACGAC GACG TC C 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
G T T T GACAACCT CC AT CAG TATGTC CC CAGGATC AT CCAGAACAT T T G T ACAGAGT TCAAC T CAC 
AGCCTCGGAACTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
T GATCACAAACGTATAGAATGAGCC AAAAG GC TACATCAT G T TGAGGG T GC TGGAGAT TT TACAT 
T T T G ACAAT T G T T T T CAAAAT AAAT G T T CGGAAT AC AGT GCAGCC C T TACG ACAGGCT TACG TAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T T GTCAT GAC AAT GTAGGAAT TGC T GAATTAAAT G T T T AGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE .aa 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKI I DPE FIVKCPAG 
CQDPKYHVYGTDWASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
E S F I VLE S KPKKGVT YP S ALTYS S SKS PAAQAGE T TKAYQRPP I PGT TAQP VTLMQLLAVT VAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMG WQYGDNPATHFNLKTHTNSRDLKTAIEKI TQRGGLSNVGRAI S FVTKNFFSKANGN 
RSGAP^TVYVVMVDGWPTDKVEEASRIAR 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI LIT DGRS YDDVRI PAMAAHLKGVI TYAI GVAWAAQE E LE V I ATHPARDHS FF 
VDE FDNLHQ Y VPRI I QNI C TE FNS QPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-263, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 

CCGAGC AC AGGAGATT GC CTGCGT TT AGGAGGT GGC T GCGT TGT GGGAAAAGCT AT C AAGGAAGAAAT T GC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAG AC TTTC AC TT C AC ACT T CGAGAGC ATTC AAAC TG CT CT CAT C AAAATCCATT TC T GGTC AT TC 
O TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
5*1 TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
J I ATTGTCCTTAGAGGAT GAACACCT TCTT T ATGGTGACATAATCCGACAAGAT TTTTTAGACAC AT ATAAT A 

acctgaccttgaaaaccattatggcattcaggtgggtaactgagttttgccccaatgccaagtacgtaatg 

III 

aagacagacactgatgttttcatcaatactggcaatttagtgaagtatcttttaaacctaaaccactcaga 

iii; 

:! ; ji gaagtt tttcacaggt tatcctct aatt gataattattcctatagaggat ttt accaaaaaacccatattt 

cttaccaggagtatcctttcaaggtgttccctccatactgcagtgggttgggttatataatgtccagagat 
Q ttggtgccaaggatctatgaaatgatgggtcacgtaaaacccatcaagtttgaagatgtttatgtcgggat 
Ifl ctgtttgaatttattaaaagtgaacattcatattccagaagacacaaatcttttctttctatatagaatcc 
13 atttggatgtctgtcaactgagacgtgtgattgcagcccatggcttttcttccaaggagatcatcactttt 
%U tggcaggtcatgctaaggaacaccacatgccattattaacttcacattctacaaaaagcctagaaggacag 
W gataccttgtggaaagtgttaaataaagtaggtactgtggaaaattcatggggaggtcagtgtgctggctt 
3 59 acactgaactgaaact catgaaaaacccagactggagactggagggttacacttgtgatttattagtcagg 

cccttcaaagatgatatgtggaggaattaaatataaaggaattggaggtttttgctaaagaaattaatagg 
accaaacaatttggacatgtcattctgtagactagaatttcttaaaagggtgttactgagttataagctca 
ctaggctgtaaaaacaaaacaatgtagagttttatttattgaacaatgtagtcacttgaaggttttgtgta 
tatcttatgtggattaccaatttaaaaatatatgtagttctgtgtcaaaaaacttcttcactgaagttata 
ctgaacaaaattttacctgtttttggtcatttataaagtacttcaagatgttgcagtatttcacagttatt 
attatt t aaaattact tcaactttgtgt tttt aaatgt tttgacgatt tcaatacaagat aaaaaggatag 
tgaatcattctttacatgcaaacattttccagttacttaactgatcagtttattattgatacatcactcca 
ttaatgtaaagtcataggtcattattgcatatcagtaatctcttggactttgttaaatattttactgtggt 
aatatagagaagaattaaagcaagaaaatctgaaaa 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLL^ 

TLRSHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKT IMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site, 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCrTCTCATC 
TCG T C CTT GCC AAGAGAGT AC AC AGT CAT TAATGAAGCCT GCCC T GGAGC AGAGTGGAAT ATCAT GTG TCGGGAGTGCT GTG 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCC AGGT TCTGC GAGC C CC AAAGGGTCAGAT T T TG T TGGAAAGCTATCCCC TAAAT GCTC ACT GTGAATGGACC ATTCAT GC 
TAAACCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAG 
G TT C GTGAT GGAGAC AACCGC GATGGCCAGAT CAT CAAGCGTG TC TGTGGC AACGAGCGGCCAGCTCC TATC CAGAGCAT AG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
r| ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
f"l TATAC TGGGC AGCGC TGTGAAAAT CTCC T T GAAGAAAGAAAC TGC TC AGACC C TGGGGGC CCAGTCAATGGGTACCAGAAAA 

ffi TAACAGGGGGCC C TGGGC T TATC AACGGAC GC CAT GC TAAAAT T GGC AC CGT GGTGTCT T T CT TT TGTAACAAC TC C TATGT 

||j TCT TAGT G G CAATGAGAAAAGAAC T TGCC AGCAGAAT GGAGAGT GGTC AGGGAAAC AGCCC ATCTGCATAAAAGC CT GCCGA 

|f| GAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGTTCAGTCAAGGGAGACACCATTACACCAGCTAT 
if. AC T CAGCG G CC T T CAGC AAGC AGAAACTGCAGAGT GCCC C TACCAAGAAGCC AGCCC T TCCCTT TGGAGATC T GC CCAT GGG 

ATACCAACATC T GC AT AC C CAGC TC CAGT ATGAGTGC AT C TCACC CT TC TAC C GCCGCC TGGGC AGCAGCAG GAGGACATGT 
CTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCGGGAAAATTGAGAACATCACTGCTCCAAAGA 
CCCAAGG GT TGC GC TGGCC GT GGC AGGCAGCC ATC TACAGGAGGACC AGCGGGGTGCATGACGGC AGCC TACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCACCAT GATCAAGAC AGCAGAC C TGAAAGTTGT TT TGGGGAAAT T C T ACCGGGAT GAT GACCGGGATGAGAAGAC CAT C C 
AGAGC CTACAGAT T T C T GC TATCAT TCTGCATCCC AACTATGAC C CC AT CC TGCTT GATGC TGACATCGCC AT CCTGAAGCT 
CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
TCC CACATC AC TGTGGC TGGC T GGAATGT CC TGGC AGAC GTGAG GAGCC C TGGC TT C AftGAACGACACACTGC GC TC TGGGG 
TGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGCAGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTT 
C TGTGCCAGC T GGGAAC CC AC T GCCCCT T CTGATATC TGCAC T G CAGAGAC AGGAGGCAT C GCGGCTGTGT CC TT CC CGGGA 
CGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCA 
CTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCC AGGGC T T CT GAC TT C AGGGACAAAAC TCAGTGAAGGGT GAG T AGACC T CCAT TGC T GGTAGGC TGATGCCGC GT C C A 
CTAC TAGGACAGCCAAT TGGAAGATGCC AGGGC TT GCAAGAAGTAAG T T TC T TCAAAGAAGACC ATATACAAAAC C TC TCCA 
C TC C ACTGACC T GGT GGTC T TC C CCAAC T TTC AGT TATAC GAAT G CCAT CAGC T TGACC AGGGAAGATC TGGGC T T CAT GAG 
GCC CC TT T T GAGGC T C TCAAGT T C TAGAGAGCTGCC TGT GGGACAGC CC AGGGCAGCAGAGCTGGGATGTGGTGC AT GC C T T 
TGTGTACATGGC CACAGTAC AGT C TGGTC CT TT TC C T TCC CCATC TC T T GT ACACAT T TT AATAAAATAAGGGTTGGCT TC T 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAA^AAAAAAAAAAAAAAAAA 



m 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKA.GSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRT CQQNGEWS GKQPI C I KACRE PK I SDLVRRRVLPMQVQSRE TPLHQLYS AAFS KQKLQS APTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Ami da ti on site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



m 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
T GAAGGTT CTCAGT CAAATT CT TTGT GAT CTACT GAT T GT GGGGG C AT GGCAAGGTT T GC TT AAAG GAGCT T GGCT GG 
TTTGGGCC CTT GTAGCTGACAGAAGGT GGC CAGGGAGAAT G CAGCAC ACT GCT CGGAGAATGAAGGC GCT T CT GTT GC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGC^ACCGAACTA 
GGGCACGGCCCTT TGAGAGAT CCACT ATT AGAAGCAGATC ATT TAAAAAAATAAAT C GAG C T TT GAGT GTT CT TCGAA 
GGACAAAGAGC GGGAGT GCAGT T GCC AACC AT GC CGACCAGGGCAG GGAAAATT CT GAAAACACCACT GCC C CTGAAG 
T CTTT CCAAGGTTGT ACCACCT GAT T CCAGAT GGTGAAAT T AC CAG CATC AAGAT CAAT C GAGTAGAT C CCAGTGAAA 
GCCTCTCTATTAGGCTGGT GGGAGGTAGCGAAACCCCACT GGTCCATATCATTATCCAACACATTTAT CGTGATGGGG 
TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
5 m ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
f~% GCAGGAACAAT GGACAGG CCCCGGAT G CCT ACAGACC CC GAGAT GACAGCT T T CAT GT GAT T CT CAACAAAAGTAGCC 

p ; % CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTT CAT CTT CAAT GTGCTGGATGGCG 

GT GTGGCAT ATC GACATG GTCAGCTT GAGGAGAAT GACC GT GT GT TAG CC AT CAATG GACAT GAT CTT CGAT ATGGCA 
f*% GCCCAGAAAGTGCGGCTCAT CT GATT CAGGCCAGT GAAAGACGT GTT CAC CTC GT CGT GT C CCGC CAGGTT C G GCAG C 

1 1| GGAGCCCT GACAT CT T TCAGGAAGC C GGCT GGAACAGCAAT GGCAGCT GGT CC CCAGGGC CAGGGGAGAGGAG CAACA 

f «5 CTCCCAAGCCCCTCCATCCTACAATTACTT GT CAT GAGAAGGT GGTAAATATCCAAAAAGACCCCGGT GAAT CTCT CG 

GC ATGACC GTC GCAGGGGGAGC ATCACATAGAGAAT GGGATTT GGCT ATCTATGT CAT CAGT GTT GAGCCC GGAG GAG 
T CATAAGCAGAGATGGAAGAAT AAAAACAGGTGACAT TTT GTT GAAT GTGGAT GGGGT CGAACT GACAGAGGT CAGCC 
GGAGT GAGGCAGTGG CATT AT T GAAAAGAACAT CAT C CTC GAT AGTAC TCAAAGCTT T GGAAGTC AAAGAGT ATGAGC 
C CCAGGAAGACT GCAGCAG C C C AGCAGC CCTGGACT CCAAC CACAACATGG CCC CAC C CAGT GACT GGT CC CCAT C CT 
GGGTCATGT GG CT GGAATTACCACGGT GCTT GT ATAACT GT AAAGATATT GTAT T AC GAAGAAACACAG CT GGAAGTC 
TGGGCTTCT GCATTGTAGGAGGTTAT GAAGAATACAATGGAAACAAACCTTTTTT CATCAAATCCATT GTT GAAGGAA 
CACCAGCAT ACAATGAT GGAAGAATT AGAT G TGGT GATAT T CT T CTTGCT GTCAATGGTAGAAGTACAT CAGGAAT GA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
T T TTATAGAAT CAAT GAT GGGT CAGAG GAAAACAGAAAAAT CACAAAT AG GCTAAGAAG TT GAAACACTAT AT TT AT C 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTT 
ACACCTCAGAAAATAT GATTCCAAAAAAATTAAAACTACTAGTTTTTTTT CAGT GTGGAGGATTTCTCATTACTCTAC 
AACAT T GT TTAT ATT T TTT CTAT TCAATAAAAAGCC CTAAAACAAC T AAAATGAT T GAT TT GTATACC C CACT GAATT 
CAAGCT GAT T TAAAT T TAAAATT TG GTAT ATGCTGAAGTCT GC CAAGGGTACATT AT G GC CAT T TTTAAT T TACAG CT 
AAAAT ATT TT TT AAAATGCATTGCT GAGAAACGT T GCT TT CAT CAAACAAGAATAAATATTTTT CAGAAGTTAAA 
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FIGURE do 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKS GSAVANHADQGRENSENTTAPEVFPRLYHL I PDGE I TS IKINRVDPSESLS IRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVI LNKS S PEEQLG I KL VRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAI NGHDLRYGS PE SAAHL I QAS ERRVHLWSRQVRQRS PDI FQEAGWNSNGS WS PG 
PGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
I KT GDI LLNVDGYE LTEVS RS EAVALLKRT S SSI VLKALE VKE YE PQEDC S S PAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features : 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE ai 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGAC CCTCAT TTT GGGGAAGGGAAAAACTC TGAC TAATGAAGCATC CACGAAGAAGGTAGAACT T 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAA.CCTTTACAAGTGTGAGGAG 
CAT CCCAAGCAT CTGGT GGT TGGC AGGAACAGC ACT GGGT ACAGG TT AC GT TACAGT GGATATT T TGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAA.GACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCT GAAGTGGGTAAAT ATAC AATGGTCTT CCAC ACTAGAGACAAAGGCAAT GAGGT GAAC GCAGAACG 
GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCT GGT GAT GTAGAGTATAAAACAGTTGTAACAAGACAGCT T TCTTAGTCATTTT GAT 
CAT GAGGGTTAAATATT GTAATAT GGATACTT GAAGGACT TTAT ATAAAAGGATGACTCAAAGGATAA 
AAT GAAC GCTATTTGAGGACTCT GGTT GAAGGAGAT T TAT TTAAATT TGAAGTAATATAT TATGGGAT 
AAAAGGC CACAGGAAATAAGACT GCTGAATGTC TGAGAGAACCAGAGTT GT T CTC GTCCAAGGT AGAA 
AGGTACGAAGATACAATACT GTTATTCAT TTAT CCT GT ACAAT CAT CTGT GAAGT GGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 
AT T AAC T AATAAT AAAT AT G T CT AT CAAATAC C T C T GT AG T AAAAT G T GAAAAAG CAAAA 
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FIGURE 42 

MGF^LTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPPCAKEFiyiANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKK 

DLVPENDENLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Q Important features: 

Signal peptide: 

amino acids 1-27 

%? s 

f|| N-glycosylation sites, 

ly amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

pi amino acids 191-202 

.? s 2 

i%i ... 
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FIGURE A3 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 
GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

Q AAAAAAPJ^^AAAAAAAAAAAAAAAAAAAAA 

.XfSS. 

IS 

w 

i: f'ii 

•:;;>■ j; 

Ml 
5 3 I 



o 

III 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQI^LQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 



Important features: 
Signal peptide: 

amino acids 1-24 

**% cAMP- and cGMP-dependent protein kinase phosphorylation site. 

G amino acids 58-59 

m 

UJ 

m 
m 
iy 

C3 
If! 
O 
111 
O 

ru 



N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 



App_ID=l 0063583 



Page 196 of 320 



FIGURE A* 

GTGGCTTCAT T T CAGT GGC TGACT T CCAGAGAGCAATATGGCTGG T T CC CCAACAT GCC T CACCC 
T CATCTATATCCT T TGGCAGC TCACAGGGTCAGCAGCCTCTGGACCCGT GAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCT T CAAC ACAACC C C T C T T GTC ACCATAC AG CC AGAAGGGGGCAC TAT CATAG T GACCCAAA 
AT CG TAAT AGGGAGAGAGTAGAC T T CCCAGAT GGAGGC TAC T CCC T GAAGC T CAGCAAACTGAAG 
AAGAAT GAC T CAGGGATC TAG TAT GT GGGGATATAC AGCT CAT CACT CC AGCAGCCC TCC AC CC A 
U : GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
C3 ATAAGAATGGCACCT GT GT GACC AAT CT GACATGC T G CAT GGAAC AT GGGGAAG AGGAT G T GAT T 

O TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
?"! C TGGAGATGGGGAGAAAGTGATATGAC C T TCAT C T GCGT T GCCAGGAAC CC TGTCAGCAGAAAC T 

U ; T CT CAAGC CC CATC C T T GC CAGGAAG CT CTGT GAAGG T GC TGC T GAT GACCCAGAT T CCT CCAT G 

^ fi 

|p GTCCTCCTGTGTCTGCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 

W GTT TCTGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGT GGACATT TGTCGGG 

% AAACT CC TAACATAT GC CC CCAT T C T GGAGAGAACACAGAG TACGAC AC AATCCCTCACAC T AAT 

*« AGAACAATCC TAAAGGAAGATCCAGCAAATACGGT TTACTCCACTGT GGAAATACCGMAAAGAT 

in 

f»J G G AAAAT CCC CACT CAC T GC T CACG AT G C C AG AC AC AC CAAGG C TAT TTGCCTAT GAG AAT G T T A 

|lj TC TAGA CAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 4.6 

MAGS PTCLTLIYI LWQL TGS AASGPVKELVGSVGGAVT FPLKSKVKQVDS I VWTFNT TPLVT I QP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGI.YSSSLQQPSTQEYYLHVYEHLSK 
PKYTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 

PRLFAYENVI 

f*% Important features: 

Q Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

J si amino acids 224-250 

W Leucine zipper pattern, 

%! t amino acids 229-251 

£ji N-glycosylation sites. 

HI amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 

291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGT CT GATGGCCATTCCAGCAACAACAAT GT CCT TGACAGCAAGAAAAAGAGCGT G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AAC AGT AAT GCCAAT T G TGAAT T T T CAT T GAAAAAC AT CAG TG AC AT T CAT CCAGAAT CCT T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATG GC GAGT GG C T GGAGAG CAT C TAGT T T CC AC T T CGAT TC T GAAG AAAACAAAC AT AGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAG T T TAAT GGGAATAAAAT GTAAGTATCAGTAGT T T GAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFL S S FFS VI TVIGALYCML I S I QALLKGPLMCNS PSNS NANCE FSL 
KNI S DIHPES FNLQW FFND S CAPPT G FNKPT SNDTMAS GWRAS S FHFDSEENKHRL I HFS VFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGC AG T GAGGAAC C T GCAGAG T CAC ACAG T T GCT GACC AAT TGAGC T GT GAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
y ; C AGCCAGAAGCAGC ACAGTCC T GTACC T GAGAAGGC CATC CCAC TCAT CAC T CCAGGCTC TGCC A 

Q CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
O TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 

m 

;; TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 



C 

2 - :; 

Q 
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FIGURE so 

MERVTLALLLLAGLTALE^DPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 3 6-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 

GTGGAC TC TGAGAAGC CCAGGCAGT TGAGGACAGGAGAGAGAAGGCTGC AGACCCAGAGGGAGGG 
AGGACAGGGAG T CGGAAGGAGGAGGACAGAGGAGGGCACAGAGAC GC AGAGCAAGGGCGGCAAGG 
AGG AGACCC TGGT GGGAGGAAGAC AC T C T GGAGAGAGAGGGGGC T GGGC AGAGATGAAG T T C CAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
C GGAGAGGAAAGCACT GGGACAAAT ATTGGGGAGGC CC T T GGACATGGC C T GGGAGACGC CCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
T TGGCAGACAGGCAG AAGATGTC AT T CG AC AC GGAGCAGATGC TGTCCGCGGCTCCT GGC AGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGT GGAGTCAATACTGTGAACTCTGAGACG TC TCCT GGGATGTT TAACT T T GACACT T TCTGGAA 
GAAT T T T AAATCCAAGCTGGGT T TCATCAACTGGGATGCC ATAAACAAGGACCAGAGAAGCT CT C 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAiWVAAJ^jy\AAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *2 



I4KFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGT GVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVI RHGADAVRG S WQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
3SGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTWSETSPGM 

s , FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

I'll Signal peptide: 

y\ amino acids 1-21 

i'fl N-glycosylation site. 

is I amino acids 265-269 

C3 Glycosaminoglycan attachment site. 

W1 amino acids 235-239 r 237-241, 244-248, 255-259, 324-328, 388-392 

o 

y.s.<f; Casein kinase II phosphorylation site. 

m amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 243-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence . 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGG ATG TCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTGCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATAT AACGATGT TCAACAAGAGTGCAAACATCATGCT TGACAAGTGGCAGCACCTGG 
C CT CAGAGGGCAGC AGT CGT C T GGACATGTT TGAGCAC AT C AGC C TC AT GACC T TGGACAG T CT A 
CAGAAATGCATCTTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
ATTACCTCTCCCAT GACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAA.GACAA 
AGC CAAGTCCAAGACT T T GGAT T TCAT T GATG T GCT TCT GCTGAGCAAGGAT GAAGAT GG GAAGG 
CAT T G T C AGAT GAG GAT AT AAGAG C AG AG G C T GACAC C T T CAT G T T T G GAG G C CAT GACAC C AC G 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGG TGCAAGAGC T T CTGAAGGACCGCGATCC TAAAGAGATTGAAT G GGACGACC T GGCCC 
AGC TGC CC T T CCT GAC C ATGT GC G T GAAGGAGAGCC TGAGG T TACAT CCCC CAGC T CC CT TCAT C 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
C CT CAT CGATAT TATAGGGGTCCATCACAACCCAAC T GT GTGGCCGGAT CC TGAGGT C T AC GAC C 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE Zd. 

MSLLSLPWLGLRPVi^SPWLLLLLWGSWLLARILAWTYAFYNNCRRLQC FPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLD^QHLASSGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHD FTDAVI RERRRTLPTQG I DDFFKDKAKS KTLD FI DVLLLSKDEDGKAL SDED I RAEADT F 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 

f«* LHPPAPFISRCCTQDIVLPDGRYIPKGITCLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 

O LAFIPFSAGPRNCIGQAFAMAEMKWLAIJ^^ 

::: vglq 

\ 58 

Ifl Important features: 

CP Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

5 S; Cytochrome P450 cysteine heme-iron ligand signature. 

Q amino acids 4 61-471 

y N-glycosylation sites. 

= amino acids 112-116, 168-172 
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FIGURE 5 5 



AT CGCAT CA&TT GGGAGTACCAT C T TCCT CATGGGACC AGTGAAACAGCT GAA.GCGARTGT T TGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TT TGGTGGCATAACAA.GGGACT T GCACTTATC T TCT GCAT TT TGCAGTCT T TGGCAT TGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGT TT TATGAAGCTT TGGAAGGCACTATGGACAGAAGCTGGTGGACAGTT TT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATT TGAGGGTTACT T T TGGAAGCAACAAT ACAT TCTCGAACCTGAATGT CAGT A 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T CT GGATGT T G T CC CAC T GAAT T CC CAT GAAT ACAAAC C T AT T CAGC AACAGC AAA?\AAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *6 

MGPVKQLKRMFEPTRL I AT IMVLLCFALTLCSAFWWHNKGLAL I FC I LQSLALTWYS LS FI PFAR 
DAVKKC FAVC LA 



Important features: 
Signal peptide: 

amino acids 1-33 

|^ Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 

i!l 
in 

w 

CI 

if! 

C3 

w 
o 
flf 
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FIGURE S7 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
AT T TT CAAGCT C AGT GTCTTCAT CCCCTCCCAGGAATTCTCCACCTACCGCCAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGACCTT GATGG GCAGCT AGACTTT GAAGAAT T TGTCCAT TAT CTCCAAGAT CATGAGAAGAAGCT GAGG 
CT GGT GTT TAAGATT T TGGACAAAAAGAAT GAT GGAC GCAT TGACGC GCAGGAGAT CATGC AGT C CCT GCGGGAC TTG 
GGAGT CAAGATAT CT GAACAGC AG GC AGAAAAAATT CTCAAGAGCAT GGATAAAAAC GGCAC GAT GAC CAT C GAC T GG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
AT CTT T GAT GT GGGT GAGAATCT AACGGT C CC GGAT GAGT T CACAGTGGAGGAGAGGCAGACGGGGAT GT GGT GGAGA 
CACCT GGT GGCAGGAGGT GG GGCAGGGG CC GTAT CCAGAACCT GCACGGCCCCCCTGGACAGGCT CAAGGT GCTCATG 
CAGGT CCAT GC CTC C C GCAG CAACAACAT GG GCAT CGTT GGTGGCTTCACT CAGATGATTCGAGAAG GAGG GGC CAGG 
|<& TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
ft*: ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
Vf AT C GCCCAGAGCAGC AT CT AC C CAAT GGAG GTC CT GAAGAC CCGGAT GGC GCT GC GGAAGAC AGGC CAGTACT CAGGA 

0 AT G CT G GACT GCGC CAGGAGGAT CCT GGCCAGAGAGGG GGT GGC C GCCTT CTACAAAGGCTAT GT CCC CAACAT GCT G 
fn; GGCAT CAT CCC CTAT GCC GG CAT CGACC TT GCAGT CT ACGAGAC G CT CAAGAAT GCCT GGCT GCAGCACTAT G CAGT G 
fl AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
i|| CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
Iff TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGT CATC CCA 
^ * GCT GTGAGCATCAGCTACGT GGT CTACGAGAACCTGAAGATCACCCTGGGCGT GCAGT CGCGGTGACGGGGGGAGGGC 
|0 CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 

1 s S AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 

GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
S CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
jf ss H TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 

CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
If I ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 

TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
III GGCCT GGAC CCTGT CAGGAT GGGCC C CACCT CAGAACCAAACT C ACT GTC C CC AC TGT G GC AT GAGGGC AGTGGAGCA 

f**j CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
Jf; ACTGT T GGGAAAAGG GTTTT GT CCAGAAGGACAAGC CGGACAAAT GAGCGACTT C TGT G CTT CCAGAGGAAGACGAGG 

I'll GAGC^GGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 

GGGAC CAGC CC CACAT TC CAC TT GT GTCACT GC TTGGAACCTAT T TAT TTT GTAT TT AT TT GAACAGAGTTAT GT C CT 
AACTAT TT T TAT AGAT TT GT T T AAT T AAT AGCT T GT C ATT TTCAAGTT CAT TTT T TAT T CATAT TTAT GT T CAT G GTT 
GAT T GTACCTT CCCAAGCC CGCC CAGTG GGATGGGAGGAGGAGGAGAAGGGGGGCCTT GGGCCGCT GCAGT CACAT CT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCT GCT GCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATT CCACCAGAAT GACCT GA 
TGAGGAAAT CTTCAATAGGATGCAAAGAT CAAT GCAAAAAT TGT TAT ATAT GAACATATAACT GGAGT CGT CAAAAAG 
C AAAT TAAGAAAGAATTG GAC GT TAGAAGT T GT CAT TTAAAGCAGC CTTCT AATAAAGTT GTT T CAAAGCT GAAAAAA 
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FIGURE *8 



eg 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRD YHLLHPVEN I PE 1 1 LYWKHS T I FDVGENL TVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVliMQVHASRSNNMG I VGGFTQMI REGGARSLWRGNGINVLK IAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LARE GVAAFYKG YVPNMLG 1 1 P YAG I DLAVYE T LKNAWLQH YAVNS ADPGVFVLL AC G TMS S T C 
GQLAS Y PLALVRTRMQAQAS I EGAPEVTMS SLFKH I LRTE GAFGLYRGLAPNFMKVI PAVS ISYV 
VYENLKITLGVQSR 

Important features: 



j^jj Signal peptide: 



amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites . 

amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein . 

amino acids 54-73, 85-104, 121-140 
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FIGURE 5Q 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTGTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGGCTCAGCT 
G GGAACAT T GGGGAG GATGGAAT CCT GAGC TGCACT T T TGAAC C TGACAT CAAACT T TCTGATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGT T CAGAGGCCGGACAGCAGTGTT TGCTGATCAAGTGATAGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACT T C TAAAGGC AAGGGGAAT G C TAACC T TGAG T ATAAAAC T GGAGC CT TCAGCAT GCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATACTCCTGTATGATTGAAAATGACATTGCCAAAGCAACAGGGGATATCAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
GTGCGTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CACCAGATATGACCTAGTTTTATAT T T CT GGGAGGAAATGAAT TCATATC TAGAAGTCTGGAGTG 
AGC AAACAAGAGCAAGAAACAAAAAG AAGC CAAAAGCAGAAGGC T CCAATAT GAACAAGATAAAT 
CTAT C T TCAAAGAC AT AT TAGAAG T T GGGAAAAT AAT T CATGTGAACTAGACAAGT GT G T TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T C T TC CCAAC T GACAAAT GC C AAAGT T GAGAAAAAT G ATC ATAAT T T T AGC ATAAACAGAGC AGT 
C GGGGACACCGAT TT T ATAAATAAACT GAGCACC T T C T T T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



m 

-KK'Sj 

id 



MASLGQILFWSIISII IILAGAIALI IGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YI I TSKGKGNANLE YKTGAFSMPEVNV.DYNASSETLRCEAPRWFPQPTVWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTI^ 
CVSS FFAI SWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



T GACGTCAGAAT CACC ATG GCCAGC TAT CC T T ACCGGCAGGGC TGCC CAGGAGC T GCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
7\AATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCAT GACAGC T T CT CGGAT GC TATGAC CCAAC CATC TGTGGAGAG TGGAGT GCACCAGGGAC C T T 
T CC T GGC T TC T T AGAGTGAGAGAAGT ATGTGGACAT CTCTTCTTTTCCTGT CCC TC TAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCAT GAGAGAGATCT GT CT ATGGGACCAGT GGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGCCAAAAGT CCAGT GAAAT T G T AAG C T TC AATAAAAGG AT GAAAC T C T GA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
S G Y I SMKE LKQALVNCNWS S FNDE T CLMM I NMFDKT KSGRI DVYG FS ALWKF IQQWKNL FQQ YDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6s 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTGTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGG TAT CAAT AATCAACCCC C AGGT AC CAG C T GG GGAAATAT TAAT CGG TAT C CAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATCTCCAATAAiyYTAAAAGCACTTA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 6d 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP. 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 



IP 11 Casein kinase II phosphorylation sites. 

v ft. amino acids 56-59, 155-158 



N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6* 



CI 
o 
m 
ill 
II! 

m 
w 

o 
If! 

c:i 

a 
rii' 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 



amino acids 37-41 



j y j N-glycosylation site. 

o 

Cfi 
W 

fjf N-rnyristoylation sites. 

CO amino acids 15-21, 19-25, 60-66 

w 

53 
If! 
O 
lil 

0 
ni 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCGCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAG GTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTC TGCCCCTGAGGTCAAGAGAGGATGGGGCTAT T CAC T T T TAT AT AT TTATATAAAAT TA.G 
TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 



MANPGLGLLLALGLPFLLARWGRAWGQI QTTSANENST VLPS STS S SS DGNLRPEAI TAI I WFS 
LLAALLLAVGLALLTOKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 



f*9 

%sl Transmembrane domain: 

^:!? amino acids 56-80 



N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids S6-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 

GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
TCAAATACTTCCTTCATTAAGCTGAATAATAAT GGCTTTGAAGATATT GTCATTGTTATAGATCCTAGT GTGCCAGAA 
GATGAAAAAATAATTGAACAAATAGAGGATATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGA 
TTTTTTTT CAAAAATGTATCTATATTAATTCCT GAGAATTGGAAGGAAAAT CCTCAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACATGCT GATGTTATAGT TGCAC CAC CTACAC T CC CAGGTAGAGAT GAAC CAT ACAC CAAGCAGT T C ACA 
GAAT GT GGAGAGAAAGGC GAAT ACATT C ACT TC ACC C CTGACCT T CTACTT GGAAAAAAAC AAAATGAATAT GGAC CA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
T ACC GT GCT AAGT CAAAAAAAAT CGAAGCAACAAGGTGTT CCGCAGGTAT CTCT GGT AGAAATAGAGT T TAT AAGT GT 
CAAGGAGGCAGCTGT CTTAGTAGAGCATGCAGAATT GATT CTACAACAAAACTGTAT GGAAAAGATTGTCAATTCTTT 
CCT GATAAAG TACAAACAGAAAAAGCAT CCATAATGTT TAT GCAAAGTATT GATTCT GT TGTT GAATTT T GTAAC GAA 
AAAAC CCAT AAT CAAGAAGCT C C AAGCCTACAAAAC ATAAAGT G CAATT TT AGAAGT ACAT GGGAGGT GATTAGCAAT 
T CTGAGGAT TT TAAAAACACCAT AC C CAT GGTGACACCAC CTC CT CCAC CT GT CTT CT CAT T GCT GAAGAT CAGT C AA 
AGAAT T GT GTGCTTAGTT CTT GATAAGT CT GGAAGC AT GGG GGGTAAGGAC CGC CTAAAT C GAAT GAAT CAAG CAGCA 
AAACAT TT CCT GCTGCAGACTGT T GAAAAT GGAT CCT G GGT GGGGAT G GT T CACTTT GATAGTACT GCCACTATT GTA 
AATAAG CTAAT CCAAATAAAAAGCAGTGAT GAAAGAAACACAC T CAT GG CAGGATTACCTACAT AT CCT CT GGGAG GA 
ACT T C CAT CTG CT CT GGAATTAAATATGC AT TT CAGGT GATTGGAGAGCTACAT T CCCAAC T CGAT GGATCCGAAGTA 
CT GCT G CT GACTGAT GGGGAGGATAACACT G CAAGT T CTT GTAT TGAT GAAGT GAAACAAAGTGGGGC C AT TGTT CAT 
T TTAT T GCT TT GGGAAGAGCT GCT GATGAAG CAGTAATAGAGAT GAGC AAGATAACAGGAG GAAGT CAT T T TTAT GTT 
T CAGAT GAAGCT CAGAACAAT GGCCT CATT GATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAATTGATAGTACA 
GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
AT AAT G GAAAAT TTCACAGTGGATGCAACTTCCIAAAAT GGCCT AT CT CAGT ATT CCAGGA^CTGCAAAGGTGGGC ACT 
TGGGCATACAAT CTTCAAGCCAAAGCGAACCCAGAAACATTAACTATTACAGTAACTT CTCGAGCAGCAAATTCT T CT 
GTGCCTCCAATCACAGTGAATGCTAAAAT GAATAAGGACGTAAACAGTTT C CCCAGCCCAATGATTGTTTACGCAGAA 
ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
T T GGAACT TT T GGAT AAT GGTGCAG GCGCT GAT TCT T TCAAGAATGAT G GAGT CT ACT CCAGGT AT TT TACAG CAT AT 
ACAGAAAATGGCAGATATAGCTTAAAAGTTCGGGCT CATGGAGGAGCAAACACT GCCAGGCTAAAATTACGGCCT CCA 
C T GAATAGAGC CG CGT ACATAC C AGG CT GG GTAGT GAACGGGGAAAT T GAAGCAAACC CGC CAAGACC T GAAATT GAT 
GAGGATACT CAGAC CACCTT GGAGGATT T CAGC C GAACAGC AT C CGGAGGT GCATTT GT GGT AT CACAAGT CC CAAGC 
CTTCCCTTGCCT GACCAATACCCACCAAGT CAAATCACAGACCTTGAT GCCACAGTTCATGAGGATAAGATTATT CTT 
ACATG GACAGCACCAGGAGATAATTT TGAT GTT GGAAAAGT TC AAC GT TATATCATAAGAATAAGT GCAAGTATT CTT 
GAT CTAAGAGACAGT T T T GAT GATGCTCTT CAAGTAAATAC TACT GAT CT GTCAC CAAAGGAGGC CAACTCCAAGGAA 
AGCTTTGCATTTA7\ACCAGAAAATATCTCAGAAGAAAATGC7\ACCCACATATTTATTGCCATTAAAAGTATAGATAAA 
AG CAAT TT GACAT CAAAAGTAT C CAACATT GCACAAGTAACT T T GTTTAT C CCT CAAGCAAAT C CT GATGAC ATT GAT 
CCTAC AC CT AC T CCT ACT CCTACTC CTACT CCT GAT AAAAGT CATAAT TCT G GAGTT AATAT TT CTACGCT GGTAT T G 
TCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATTTGAACCTTAACGAAGAAAAAAATCTTC 
AAGTAGAC CTAGAAGAGAGTTT TAAAAAACAAAACAATGTAAGTAAAGGAT ATTT CT GAAT CTT AAAATT CAT CC CAT 
GT GTGATCATAAACTCATAAAAATAATTTTAAGATGT CGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATA 
T GTAAAAACT GT CAAGATTAAAATTT AAT AG T T T CAT T TATTT GTTAT TT T ATTT GT AAGAAAT AGT GATGAACAAAG 
AT CCTTTTTCATACT GATAC CT GGTT GTAT ATT ATTT GAT GCAACAGT TT T CTGAAAT GAT ATTT CAAAT T GCAT CAA 
GAAAT TAAAAT CATCT ATCT GAGTAGT CAAAATACAAGTAAAGGAGAGCAAATAAACAACA.T T T GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANS SVPP ITVNAKMNKDVNS FPS PMI VYAE I LQGYVPVLGANVTAFI ESQNGHTEVLELLDNGA 
GADS FKNDGVYS RYFT AYTENGRYS LKVRAHGGANTARLKLRP PLNRAAY I PGWWNGE I EANP P 
RPEIDEDTQTXLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 

KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site, 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGA.CAGCAAAGACC6GGAAAGACCATACGTCCCCGGGCAGGGGTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCT GCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACT CGTT CTAGAAGGAAATGGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCT GGTGCTCCTCT GCTGTGCTATCTCTGTCCTGTACAT GTTGGCCTGCACCCCAAfiAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
** b CGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
?*% AATGGGCAGTACCAAGCCAGCGATGCTGCTGGGCTGGGTCTGGACA.GGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
ZZ CT GGC C TT C CTG CACT CGC AGGT GGACAAGG CAGAGGT GAAT GCTGGC GT CAAGCTGG CCACAGAGTAT GCAGCAGTG 

CCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCTGTG 
ff| AG GAAGGACAAGC GGGATGAGT T GGT GGAAGCC ATT GAAT CAGC CTT GGAGAC C CT GAACAAT CCT GC AGAGAACAGC 

l I C C CAATCACCGT CCT T AC ACG GC CTCTGATT T CATAGAAGG GAT CTAC C GAACAGAAAGGGACAAAGGGACAT TGT AT 

%M GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAA 
|fl GT GAA&AAT GAAAAGCT CAAC AT GGC CAACACGCTTATCAATGT TAT C GT GC C T CTAGCAAAAAGGGT GGACAAGT TC 

CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
*^ AAAGAAGAAATAAAT GAAGTCAAAGGAAT ACTT GAAAACACTT C CAAAG CT GC CAACT TC AG GAACTT T AC CT TC ATC 

| § | CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 

TTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
S ^ GTATT T TAT CCAGTT CTTT T CAGT GAGTACAAT C CT GGCATAAT ATAC GGC CAC CAT GATGCAGT CCCT CC CT TGGAA 

Q CAGCAGCT GGTCATAAAGAAGGAAACTGGATTT T GGAGAGACT T TGGATT T GGGAT GAC GT GTCAGTAT CGGT CAGAC 

s %g TTCAT CAATATAGGT GGGTTTGATCTGGACATCAAAGGCT GGGGCGGAGAGGAT GTGCACCTTTATCGCAAGTATCTC 

%H CACAG CAAC CTCATAGTGGTACGGAC GCCT GTG C GAGGACT CT T CCAC CTC TGG CAT GAGAAGCGCTGCAT GGACGAG 

0 CTGACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGGGCATGCTG 

1 . 3 GT G TT CAGG CACGAGATAGAG GCT CACCTT CGCAAACAGAAACAGAAGACAAGT AGCAAAAAAACATGAACT CCCAGA 
: yi GAAGGATT GTGGGAGACACTTTTTCTTTCCTTTT GCAATTACT GAAAGTGGCTGCAACAGAGAAAAGACTT CCATAAA 
ill ' GGACGACAAAAGAATT GGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCT CTCT GTT GGGCTTTTTACAACAGA 
}pi AATCAAAAT CTCCGCTTTGCCT GCAAAAGTAACCCAGTTGCACCCTGTGAAGTGT CT GACAAAGGCAGAATGCTT GTG 
1 AGATTATAAGCCTAAT GGT GT GGAGGTTT T GAT GGT GTTT ACAATACACT GAGACCT GT TGTT T T GTGT GCTCATTGA 

AATAT T CAT GATTTAAGAGCAGTTTT GTAAAAAATT CATTAGCAT GAAAGGCAAGCAT ATT T CTC CTCATATGAAT GA 
GCCTATCAGCAGGGCT CTAGTTT CTAGGAAT GCT AAAATAT CAGAAGGCAGGAGAGGAGAT AG GCT TAT TAT GAT ACT 
AGT GAGTACATT AAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAATATCGTGT CATATTTT CCCCAAGAT 
T AAC CAAAAATAAT CT GCT TAT CT TT TT GGT TGT C CTT TT AACT GT CT C CGTTT T T TT CT T TTAT TTAAAAAT GCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
ATTTTTATATTTTTTAAGAAGATACTTTGAGAT GCATTAT GAGAACTTTCAGTT CAAAGCATCAAATTGATGCCATAT 
CCAAGGACATGCCAAATGCTGATTCT GTCAGGCACT GAATGTCAGGCATT GAGACATAGGGAAGGAAT GGTTTGTACT 
AATACAGACGTACAGATACTTT CTCT GAAGAGT ATT TT CGAAGAG GAGCAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACTTT CTGCTTTACAGAAAAGGAAACTCATTCAGACTGGT GATATCGT GAT GTACCTAAAAGTCAGAAACCACATTTT 
CT CCT CAGAAGT AGG GACC GCT TTCT TACCTGT TTAAATAAAC CAAAGTAT AC CGTGT GAACCAAACAATCT CTT T TC 
AAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATGGAGAAAAATATATATATATATATATATATTGT 
GAAAGATCAATCCAT CTGC CAGAATCTAGT GGGATGGAAGTTTTTGCTACATGTTAT CCACCCCAGGCCAGGTGGAAG 
TAACTGAATTATTTTTTAAATTAAGCAGTTCTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTATTACCATTT 
CAAACTAT TT TT TAAAAATAAAT ACAGTT AACATAGAGTGGTT T CTT CAT T CAT GT GAAAATTATTAG CCAG CAC CAG 
AT GCAT GAGCTAATT AT CT CTT T GAGT C CTT GCTT CT G TT T GCT CAC AGT AAACT CATTGT T TAAAAGCT T CAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
C CAT GAAT GGAAGGT GGTATTG CACAG C T AATAAAAT ATGATT T G TGGAT ATGAA 
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FIGURE 72 



MI^/RRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPG 1 1 YGHHDAVPPLE QQLVIKKETG FWRDFGFGMTCQYRS DFINIGG FDL D IKGWGGEDVHLYR 

KYLHSNLIWRTPYRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNELA.SHGQLGMLVFRHEIEAHL 
RKQKQKT S S KKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites* 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE Z3 

GAGACTGCAGAGGGAGATAAAGAGAGAGGG CAAAGAGGCAGCAAGAGAT T T G TC C TGGGGAT C C A 

GAAACC CATGAT AC CC TAC TGAACACC GAAT CCCCT GGAAGCCCACA.GA.GACAGAGACAGCAAGA 

GAAGCAGAGATAAAT ACAC TCAC GC CAGGAGC TCGCTCGCTCTCTCTCTCTCTCTC TCAC T C C T C 

CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 

CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 

GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 

GGAAACAAT GCC CAG T CGCCCAT CGATATTCAGACA.GACAG TG T GACATT T GACCC TGAT T T GCC 

TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 

ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 

GCC CAGCT C CAC CT GCAC T GGGGTC AGAAAGGATCC CC AGGGGGGTCAGAACAC CAGATC AACAG 

T GAAGCCACAT T T GCAGAGCTCC ACAT TGTACAT TAT GAC T CTGAT T CC TATGAC AGC T T GAG T G 

AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 

AAT ATAGCT TAT GAAC ACAT TC T GAGT CAC T T GCAT GAAGT CAGGCATAAAGAT CAGAAGAC CT C 

AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 

GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 

ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 

GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 

TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 

TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 

CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 

CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 

C CAGAAACAC T GTAGGAGTAG TAAGCAGATGT CC T C CT TCCCCTGGACATC TCT T AGAGAGGAAT 

GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 

ATGAGGAAAT CGCTGTGT TGT TAAT GCAGAGANCAAACT C TGT T TAGTT GC AGGGGAAGT T T GGG 

ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 

C T C C T TAG GAT AAAG AG T T GC TGT T GAAGT T G TATAT T T T T GAT CAAT AT AT T T GGAAAT TAAAG 

TTTCTGACTTT 
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FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQIN3EAT 
FAE LH I VH YD S D S YD S L S EAAE RPQG LAVL G I L I E VGE TKN I AYEH I L S HLHE VRHKDQKT S VP P 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 

QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 



Important features of the protein: 

T\ 

Ti Signal peptide: 

CP 

:Vs amino acids 1-15 

m 

CO Transmembrane domain: 

— amino acids 291-310 

-"*??.. 

ffjj N-glycosylation site. 

Q amino acids 213-216 

III 

J!*! Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 

TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 

ATGGAC T CCACC AGAGGGT C T TC AAGGAGT TAAAG T TACT T ACAC TGTGCAG TAT T T CAT CACAA 

AT T GGCCCACCAGAGGT GGCAC T GACT ACAGAT GAGAAG TCCAT TTCTGTTGTCC TGACAG C T CC 

AGAGAAGT GGAAGAGAAATCCAGAAG ACC T TCC TG T T T CCAT GCAAC AAAT ATAC TCCAATC TGA 

AG T AT AACGTG T C T G T G T T GAAT ACTAAAT CAAACAGAACG T GGT CC CAGT GTGTGACCAAC CAC 

ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 

AGGGCCCC CTCGCC G T GCTCAGC C T T C TGAGAAGCAGT GTGCCAGGAC T T T GAAAGAT CAAT CAT 

CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 

TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 

GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 

ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 

AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 

GGAAGAGGAGGT GAAACAT T TAGGGTATGCTTCGCATTTGATGGAAATTTT TTGTGAC TCTGAAG 

AAAACACGGAAGGT AC TT C T CT CACC C AGCAAGAG T CCCT CAGCAGAACAATACCCCCGGATAAA 

ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 

GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 

TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 

AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 

C CAGACAGGCCACCAGGAGAAAAT GAAACC TAT C T CAT G C AAT TCAT G GAGGAAT GGGGGT TATA 

TGTG CAGATGGAAAAC TGATG CC AACACT TCCTTTTGCCTTT T GT T TCC TGTGCAAACAAG T GAG 

TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 

CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 

GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 

AGCAGT CAAT AAAT G T T T GCC AGAC T GGG TGCAGAAT T T AT TC AGG T G GG T G T 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPSVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPIAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGC TGAAGGAC CACAACGCCACCAGCATCCT G CAG CAG C T GC CG CT GCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
W TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
Jlf CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
|s| TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
|1| CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
W CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 

TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
% % AGGTGACCAAG T GGT T CAATAAC TC TGCAGC T TC CC TGACAAT GCCC AC CC TGGACAACAT C CC G 

Ifl TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
O ATTCATGGTCCTGTTGGACTCTGTGGTTCGTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 

T GATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCC TAAC TCAGGAC 
y ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAA.GTGTT 

TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T T ACACCAAAGGT GACCAAC T TAT AC T CAAC T TGAAT AAC AT CAGC T C TGAT C GGAT CC AGCT G 
ATGAAC T C T GGGAT TGGC T GG T T CC AACC TGATGTTCT GAAAAACAT CAT C AC TGAGAT CAT CCA 
C TC CATCC TGCT GC CGAACCAGAAT G GCAAAT TAAGATCT GGGG TCCCAG T G T CAT T GGT GAAGG 
C CT T GGGAT TCGAGG CAGC TGAGTC C T CAC TGACCAAGGATGCC C T T G T G C T TAC TCCAGCC TCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 

GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGG I PVLGS LVNT VLKH 1 1 WLKVI TANILQLQVKPSANDQEL LVK I PLDMVAGFNTPLVKT J VE 
FHMT TEAQAT I RMDTSAS G PTRLVLS DCATSHGS LRI QLL YKL S FLVNALAKQVMNLLVPS LPNL 
VKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRI QLMNS G I GW FQ P D VLKN IITEIIHSILL PNQNGKLRS GVP VS L VKAL G 
FEAAE S S L TKDALVLT PAS LWKPS S P VS Q 



l|f Signal peptide: 



Important features of the protein: 
Signal peptide: 

amino acids 1-21 



N-glycosylation sites. 

1|| * amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

If. amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 



App_ID=10063583 



Page 230 of 320 



FIGURE 7Q 



GAGAGAAGT CAGCCTGGCAGAGAGAC T C T GAAAT GAG G GAT TAGAGGT GT T CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
C CAAGGGCCTCT GGATGGAATGTGCCACACACAGCACAGGCATCACC CAGT GTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
H CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
y GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
m ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
I , !; CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
If! CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CP CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
*^ GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
jU GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
Ifl GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
O GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
W TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
^ GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 

TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
C C T CCAAAGAAAC T GAT T GGC CCT GGAACC TC CATCCCAC TC T T GT T AT GAC T C CACAGT G T CCA 
G AC TAAT T T G TGCAT GAAC TGAAAT AAAACCAT C CT AC GGTATCCAG GGAACAGAAAGCAGGAT G 
CAGGATGGGAGGACAGGAAGGCAGC C T G GGACAT T T AAAAAAATA 
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FIGURE 80 

MASLGLQLVGYILGLLGLLGTLVMLLPSWKTSSWGASIVTAVGFSKGLWMECATHSTGITQCD 
I Y S T LL GL PAD I QAAQAMMVTS S AI S S LAC 1 1 S WGMRCT VFCQE S RAKDRVAVAGGVF F I L GGL 
L G F I P VAWNLHG I LRDFYS PL VP DSMKFE I GEAL YLG 1 1 S S L FS L I AG IILCFSCSS QRNRSN YY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 4 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
J;J C CAT GGGAAGAGAT AC TCC CCCGG CGAGAGC TGGCACCCC TACT T GGAG CCACAAGGC C T GATG T 

It ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
|f| GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
ly TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
W AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

T't ACAGAGGGCCAGAT CTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGC CCAGCACCCCT CCC 

111 

ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
111 AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
O CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGAT CGTC C TGAAGGAGAAACATAAGAAAGCCT 

2f GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
Jj[ CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACT T GGT GGAGAT C TACC TC T GGAAGC T GGTAAAAGATGAGGAAAC TGAGGC TCAGAGAG 
G T GAAG T AC C T G G C C C AAG GC C AC AC AG C C AG AAT CTTCCACTT G AC T C AGAT CAAGAAAG T C AG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACC XAACAGT T GCAGAT ATGAGC T GTATAAT TG T TG TT ATTATATATTAAT AAA 
TAAGAAGT TGCATT ACCC TCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 

MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSC YRLHCPPVHC PQPVTE PQQCCPKCVEPHT PSGLRAPPKS CQHNG TMYQHGE I FSAHELF 

PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S STRCPKAPGRVLVHT S VS P S PDNLRRFALEHEASDLVE I YLWKLVKDEETEAQRGEVPGPRPH 

SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 8* 



GACAGC T GTGTCTC GAT G GAGT AGACTCTC AGAACAGCGC AGTT TGCC C T CC GCTC ACGC AGAGCC TCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTGCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGG AC AAAACT GG T GAAGGAT T CT ATTGCGGAGGGGC GCATCT CT CT GAGGCTGGAAAAC ATT AC T 
Q GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
Q ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
Cft TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
III ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
Wj CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
"f'l GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
W TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
m G AGAAGAAAGCACGGAC AGGC AGAAT TGAG AGACGC CC GGAAAC ACGC AGTGGAGGT G AC TC T GGATCCAG 

i|f« AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
r| CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 
|y CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
O AGGAGT ACGT GACT T T GT C TCCCGAT CATGGG TACT GGGTC C TC AGAC TGAAT GGAGAAC AT T T GT ATTT C 

111 ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GACATTACATTT AGTT TGCTCT CACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 



App_n>=10063583 



Page 23 5 of 320 

i 



FIGURE 8a 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIMGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGS VPL I S ITGYVDRDIQLLCQS SGWFPRPTAKWKGPQGQDLS TDSRTNRDMHGLFDVE I SL 
TVQENAGSISC5MRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
H LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 

PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

m 

III Signal peptide: 

iff amino acids 1-17 

Transmembrane domain: 

t*% amino acids 239-255 



w 



App_ID=10063583 



Page 236 of 320 



FIGURE 8* 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGAG&TGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
|f TACTTCTTTCGTATGGAGAAAGGAAGTATAAM.TGGAATTATAAACATCACCGGCTCTCTGTGAA 
IS TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
fp AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
%ll GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
Wl GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
•^f ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
Q GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
Ul GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

WJ C TGAGGGATGCAGC T GAAT T CAC C T GC AGAGC T CAGAACC CTCTCGGCTCT CAGCAGG TC TACC T 

I - £ 

5|; GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
Jjt CCACAGCGCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACTGGAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
G AGG CTGATTCTTG T AGAAT T AAC AG C C C T C AAC G T GAT G AGC TAT G AT AACAC TAT GAAT TAT G 
T GC AGAGTGAA&AGCACACAGGCT T TAGAG T CAAAGTAT C T CAAACCTGAAT CC ACAC TG TGCCC 
TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLS FCVI FWVRS CRKKS ARPAAG VGD T G I E 
DANAVRGS AS QGPLTEPWAEDS P PDQPPPASARS S VGE GE LQYASL S FQMVKPWDSRGQEAT DTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids. 351-370 
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FIGURE 87 

AGAAAGC TGCAC T C T GT T GAGC T CCAGG GC GCAG T GG AGG GAGGGAGT GAAG GAGC TC T C TG TAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
T C T TCG TC TC CAT CT C T GCCCAGAAGC TGCAAGGAAAT CAAAGAC GAAT G T CCTAGTGCAT T TGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
H GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
w CAAC TACAACAC C T T T GGAT C TGCAGAGGC GGCCAC GAGC GATGACT ACAAGAACC CT GGC TAC T 

Jf ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
Ui AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
HI GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 
CO CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
^ GGCCAGCGGGAAT TCACTGCGGGAT T T GT TCAGTTCAGGGTAT TTAATAACGAGAGAGCAGCCAA 

% % CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
*h GATACTTT CCAGAGGCCAGTCCCCAGCAGTGTGGAGATTT T TC TGGT TT TGAT TGGAGTGGATAT 

Q GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
W TTGAGAGT T T TGTGGGAGGGAAC CCAGACCTC TCCT CCCAACCATGAGATCCCAAGGATGGAGAfi 

J** CAACTTACCCAGTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGA 
311 AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYMTFGSAJEAAT 
S DDYKNPGYYD I QAKDLGI WHVPNKS PMQHWRNS S LLRYRT DTGFLQTLGHNLFG I YQKYPVKYG 
EGKCWTDNGPVIPVVTDFGDAQKTASYYSPYGQREFTAGF\ r QFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

I™ Important features: 

|«J Signal peptide: 

in amino acids 1-16 

z ;: 
~i :■ '■' 

UP N-glycosylation site. 

J-^ amino acids 163-167 

% 

Q Glycosaminoglycan attachment sites. 

|l| amino acids 74-78, 289-293 

ill N-myristoylation sites. 

p| amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
C6CCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTGCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
D - TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 

Jrf TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCGGCTTCGTATTCCTGGAAGAAGGAAG 
|:I GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
III TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 

19 T TAAGAC T TATATACAG T T T TAGGGGACAAT TAAAAAAAAAAAA 

W 
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FIGURE QO 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVMSFFIFERWECTTYWYIFVFCSALPAVTEMALEV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAG ATGC AACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATG CCACAGGC CAGGGAAACAT CTCCAT CAGC C TC GTGCCC CC CAGTAAAGCT GTAG 
AGT TCCACCAGGAACAGCAGATCT T CATCGAAGCCAAGGCC TCCAAAATCTTCAAC TGCCGGAT G 
GAGTGGGAGAAGG TAGAACGGGGCC GC CGGACCTCGCT T TGCACCCACGACCCAGCCAAGAT CT G 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
C C T TGGT TC T TGCCAT CC T GAGGAAAGATAGCAACAGGGAGGG GGAGAT T T CATCAGTG TG GAC A 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATC C T TAAGC TAAGACAGGACGAT T G T GG TCC TCCCACACTAAGGC CACAGC CC AT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNI S I SLVPPSKAVE FHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKI CS RDHAQS S AT W SC S QP FKWC VY I AFYS TDYRLVQKVCPDYNYHSDT PYYPS G 



iiU! 



5-U 



Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGA.CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
T TT TCTGAAAATCCCTTTTTCTGGT GGAAT TGAGAAAGAAATAAAAC TATGCAGATA 
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FIGURE qa 

MTAAVFFGCAFI AFG PALAL YV FT I AI E PLR 1 1 FL I AGAF-FWLVS LL I S S LVW FMARVI I DNKDG 
PTQKYLLIFGAFV5VYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVS AQT F I S S YYG INLAS AF 1 1 LVLMGTWAFLAAGGS CRS L KL CL LCQDKNFLL YNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
G AC C C AAAG G TAG C AAT C T GAAACATGAGG AG T AC G AT T C T AC TGTTTTGTCTTC TAG GAT C AAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC TACCAAAC CAACAGCAGTCAAATC AGG TC T TTCCTTCTT TAAG T C T GATACC AT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GATTATAT TATGGAATAGATTGAGACACAT TGGATAGTCT TAGAAGAAAT TAAT TCTTAATT 
T AC C T GAAAAT AT T C T T GAAAT T T CAGAAAATAT GT T C TAT GT AGAGAAT C CCAAC T T T T AAAAA 
CAATAATTCAAT GGATAAATCTGTCT T TGAAATATAACATTATGC TGCCTGGATGATATGCATAT 
T AAAACATATT T GGAAAAC TGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 06 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPMQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
T DDDFAVT T PAG I QRSTHAIEEAT TE SANG I Q 



Signal peptide: 

H amino acids 1-16 

o 
o 

Ml 
If! 

m 

Id 

o 
m 
o 
w 
a 
rii 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 

CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 

GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 

AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 

CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG- 

GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 

GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 

AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 

AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 

GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 

CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 

AC C AGC C T TGAC CACGCCC T GGAGAGGACC TACCAG C T G T T GG TACAG GTC AAGGACAT GGGT GA 

CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 

C CCTAGAGCC TAT C CACC T GGCAGAGAATC T CAAAGT CC TATACCCGCACC ACATGGC C CAGG T A 

CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 

T GCAGAGG GAAACCTC TACGTGACCAGAGAGCT GGACAGAGAAGCCCAGGC TGAGTAC C T GC TC C 

AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 

ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 

CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 

ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 

CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 

CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 

TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 

ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 

CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 

GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 

GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAA.GCTGGTGGGGCCAGGCCC 

AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 

CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 

ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 

TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 

GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 

GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 

GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 

CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 

CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 

AT T T TCACCCAC TGGAC CATG TCAAG GAAGAAGGACCCGGATCAACCAGCAGACAGCG T GCCCCT 

GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 

T CCCC TGGGAGAGAG CC CAGCAC CCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCC T CCA 

TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 

TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 

CTGGGCTGGGCCCTATGGGAAAAAAAA&AAAAAAAAAAA^^ 
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3 a 

amino acids 1-18 



FIGURE Q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAT 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWWVQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI IPVWSHNAQMWQLLVRVI VCRCNVEGQCMRKVGRMKGMPTKLSAVG I LVGTLVAI G I FL I 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 



Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AG T CCAAG AT TC T T CC CAGGAACACAAACGTAGGAGACCCACGCT CCT GGAAGCAC CAG C C T T T A 

TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATC TAGGACCCAGGCAT C T T GC T T TCCAGC CACAAAGAGACAGATGAAGAT GCAGAAAGGAAAT G 
T T C TCC T TATG T T T GGT CTACTAT TGCAT T TAGAAGC T GCAACAAAT T CCAAT GAGA.C TAGC AC C 
T C T GCCAACACT GGATC CAGTGT GA.TC TCCAG T GGAGCCAGCACAGCCACCAAC T C T G GGTC CAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACT C T GAGT T C CATACAACC T C CAGT GGGATCAGC ACAGC CACCAAC TC T GAG 
T T CAGCACAGCGT CCAG TGGGAT CAGCATAGCCACCAACT C TGAGTC CAGC ACAACC TCCAGT GG 
GGC CAGC ACAGC CACCAAC T C T GAGT C CAGCACACCC T CCAGTGGGGCC AGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACT CTGAC TCC AGCACAG T G T C CAGT GG GGCCAGCACTGCCACCAACT C T GAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
C TGAGT CCAGCACAG T GT CCAGT GGGATCAGCACAGT CACCAAT T CT GAG T CCAGCACACCC T CC 
AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
CAACT C TGAGTC CAGCACAGTG T CCAG TGGGGCCAGCACT GCCACCAACTC TGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCAC C AACTCT GAC T C CAGC ACAACC T CCAG TGAG GC CAGCACAGC CACCAAC TC TGAG TC TAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGTC C TAACT GGTTC TGGAGGAGACCAGTATCAT CGATAGC CAT GGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT C T T G AAG AAG GT AT T C C T C AC CTTTCTTGCCTT T AC C AGAC AC T G G AAAG AGAAT AC TAT AT 
T GC TCAT T TAG CTAAGAAATAAATACATC TCAT C TAAC AC ACACGACAAAGAGAAGCT GT GC T T G 
C CC CGGGG TGGGTAT C TAGCT C T GAGATGAAC T CAGT TATAGGAGAAAAC C T CC ATGC T GGAC T C 
CAT C T GGCATT C AAAATC TCCACAGTAAAATCCAAAGACC T CAAAAAAAAAAAAAAAAAAA&AAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 

MKI4QKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSES S T TS SGAS TATNS ESS TVS SRAS TATNS E S S TT S S GAS TATNSESRTTSNGAG TATNS ES 

STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNS ESS TVS S G I S T VTNS ES S T PS S GANTATNSES S T TS S GANT ATNS ESS TVS S GAS T A 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
i*| STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
Q AVGLFAGL FFCVRNS LS LRNT FNTAVYHPHGLNHGLG PG PGGNHGAPHRPRWS PNWFWRRPVS S I 

|l! AMEMSGRNSGP 



. : 

ill 



Signal peptide: 

amino acids 1-2 0 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAJSGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATAT ATG CATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGT T TCTAGCTACTGCACGTACT TCATAAAGCAGGACTC TAAAAGCT TTGGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GATTATAACTGGGGATGCATTTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAA 
GAATGGAC TGGATGT T CCT GTGGAAT TGCCTGATGCGATATAGC TAG C T CAGAT TGGAGAAAAT T 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CT T CAT TCATAGGAAATGGAAGGAT GACAAGAGCCAT T TCGAAGACATGAT TGATTAC TT TTGT G 
ATATTCACGAACCACTTCAACTCCTCATATTCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
T CT CGAAGTAAT GCAT T TGC TGAAAAAAAT GGAC TT CAGAAATAT GAAT AT G T TT TACAT CCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATAT CACT GTGGCGTATCC TCAC AACAT T C CTCAAT CAGAGAAGCAC CT CC T CCAAGGAGAC TT T 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
C C T T CAAC TCTGG T GCCACAAACGGT GGGAAGAGAAAGAAGAGAGGC T GCGTTC C T TC TATCAAG 
GGGAGAAGAAT T T TTAT TT TACCGGACAGAGTGTCAT T CCACC T TGCAAGTCTGAACTCAGGGT C 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT T TGTACAGTCT T GT TAAGTGGT AT T T TATAAT CACCAT TGTAATCT T T GTGC TGCAAGAGA 
G AATAT T T G G T G GAC T G GAGAT C AT AG AAC T T G CAT G T T AC C G AC T T T T AC ACAAACAG C CACAT 
T T AAAT T C AAAG AAAAAT G AG TAAG AT T AT AAGG TTTGCCATGT G AAAAC C TAG AG CAT AT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCAT TAT T TGT T AAAGATAT T T T GCAC T TAAT T TTGT GGGAAAAATAT T GCTACAAT T TT T 
T T TAATC TCT GAAT G T AAT T TCGAT AC TGTGTACATAGCAGGGAG T GAT CGGGG T GAAATAAC T T 
GGGCCAGAATATTATTAAACAATCATCAGGCTTTTAAA 
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FIGURE 102 



MHSRGRE I WLLNPWS INEAVS S YCTYFIKQDSKS FG IMVSWKG I Y FI LTLFWGS FFGS I FMLSP 
FLPMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI I TGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA' 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPT3KEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LE I I ELAC YRLLHKQPHLNS KKNE 



Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACC TC TACAGAGAT GGGGAAGACT GGGAATCTAAGCAGATGC CACAG TAT CGAGGGAGAACT GA 

GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTMGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
U> GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
r\ . AGGAT T T G T C T T CAGAC T C CAGAGCAAAT GCAGATGGGTACAGCC TGTATGATGTGGAGATC T CC 

E ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
J;I GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 

TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
III AAATCCAAAGGGAAAATCCAGGC GGAACTGGAC TGGAGAAGAAAGCACGGACAGGCAGAAT T GAG 

HI AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
m TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
ft TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGT TTCCAAGCAGGGAGACAT TAC TGGGAGG T GGA 

W CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
9 ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
Q ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
5H GGAC TAT GAGGGTGGGACCATCTCC T TCT TCAATAC AAAT GACCAGT CCC T TAT T TATAC CC TGC 

Zl T GA.CATGTCAG T TT GAAGGC T T GT T GAGACCC TATATC CAGCAT GCGAT G T ATGACGAGGAAAAG 

" GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
III C CC ACAC CACAGAC CCAGACACAGC CAAGGGAGAGT GC T C CCGACAGG TGGCCC CAGC T T CCTCT 

III CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
m TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
s ** T CAGAAGC CAT GGCTG C CC TGAAGT GGGGACGGAATAGAC TCACAT T AGG T T TAGTT T G T GAAAA 

CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAG TGAT TCCT GCCTCACAGGTGAAGATTAAAGAGACAACGAATGT GAAT CATGCTTGCAGGT T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
T GC T TAT TCCAC AT TAAT T TACT T T TCTCTATACCAAATC ACCCATGGAAT AG T TAT T GAACAC C 
T GC T T TGT GAGGCT CAAAGAATAAAGAGGAGG TAG GAT T T TT CACTGATT C TAT AAGCCCAGCAT 
TAC C T GAT AC C AAAAC CAG GCAAAGAAAAC AG AAG AAG AG GAAGGAAAAC T ACAG G T C CAT AT C C 
C T CAT TAACACAGACAC AAAAAT TC T AAATAAAAT T T T AACAAAT T AAAC T AAAC AATAT AT T T A 
AAGATGATATATAACTACTCAGTGTGGTT TGTCCCACAAATGCAGAGTT GGTT TAATAT T TAAA.T 
AT C AAC CAG T G TAAT T CAG CACAT T AAT AAAG TAAAAAAG AAAAC CATAAAAAAAAAAAAAAA 
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FIGURE lozt 



fl 



ro 



MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
S KGK I QAE LDWRRKHGQAE LRDARKHAVE VT LDPE T AHPKL CVS DLKT VTHRKAPQE VPHS EKRF 
T RKS WAS QGFQAGRHYWEVDVGQWGW WGVCRDDVDRGKNOTTLSPKfNG YWVLRLTTEHL YFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE 1Q5 

CCTTCACAGGACTCTTCATTGCTGGTTGGCA ATX3A TGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GTTC AT TATGTGAG AT AT AAT C AAAAGAAGACCT AC AATT AC TAT AGC AC AT TGT C ATTT AC AAC 
T G A.CAAAC T AT ATGCT GAGTTTGGCAGAGAGGCT TC TAAC AATTTT AC AGAAATG AGCCAGAGACT TG AAT 
C AAT GGTG AAAAAT GC AT TTTAT AAAT C TC CATT AAGGGAAG AAT T TGT C AAGT CT CAGGTT ATC AAGT T C 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGTAGAT AAAATTGT TCAACTTGTT TTACATGAAAAGCTGCAAGATGCTGT AGGACCCCCTAAAGTAG 
H AT CCTC ACTCAGTT AAAATTAAAAAAAT CAAC AAGACAGAAAC AGACAGCTATCTAAACCAT TGCTGCGGA 

O ACACGAAGAAGTAAAACTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAGAAGTAGAAGAGGGTGAATG 
O - GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 

T T GT GAGTGCTGCTCACT GTTT T ACAAC AT AT AAGAACCCT G CC AG AT GG AC T GCT T CCT TT GG AGTAAC A 

*JJ ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
Zl TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
?n TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 

GATGGT TACAGTCAAAATC ATCTT CGACAAGCACAGGTGACTCTCAT AGACGCT ACAACT TGCAATGAACC 
P TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
|fl G CCAGGGTGACT CT GGAGGACCACTGGT T AGTTC AGAT GCT AGAG AT ATC TGGT ACCTT GCT GGAAT AGT G 

Ql AGCT GGGGAG ATGAAT GTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTT ACGGCCTT GCGGGACTG 

W GA.TT ACTTCAAAAACTGGTATCTAAGAGACAAAAGCCT CATGGAAC AGAT AAC ATTTTTT TT TGTTTTTT G 

"W GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 

ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACTCT GTCAT C TGT GAG CAAT AGT TGAAACT TT ATGT AC AT AG AGAAAT AG AT AATAC AAT AT T AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GC AT AT AT ACAAT TTGAAGC ACT C CT TT T C T T CAGT TCCT C AGCTCCT CT C AT T TC AGCAAAT ATCC AT T T 
TC AAGG T GC AGAAC AAGGAGTG AAAGAAAAT ATAAGAAGAAAAAAAT C CC CT AC AT T T TAT T GGC AC AG AA 
AAGT AT T AGGTG T T T T TCT T AGTGG AAT ATTAGAAAT GAT C AT AT TCATT ATGAAAGGT C AAGC AAAG AC A 
GC AGAAT ACC AAT CAC T T C ATCAT TT AGGAAGT ATGGGAACT AAGTT AAGGAAGTCCAG AAAGAAGCC AAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AATAATT ATACAAACT TC ATGC AAT GT ACT TGT T CT AAGC AAATT AAAGCAAAT ATTT AT TT AACATT G 
T TACT GAG G ATGT CAAC AT AT AAC AAT AAAAT AT AAAT CAC CCA 
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FIGURE 106 



MM YRP DWRARKRVC WE P W V I GL VI F I S L I VLAVC I G L TVHYVR YNQKKT YNYY S T L S FT T DKL Y 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
T E VEEGE W PWQASLQ WDGSHRCGAT L I NATWLVS AAHC FT T YKNPARWTAS FGVT I KP SKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVT L I DAT T CNE PQAYNDAI T PRML C AG S LE GKT DACQGDS GG P LVS S DARD I W YLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 1Q7 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT ATGGT CACT T T GAAAGCCAGGAT GACAAAT TAT GGAT TACC TAGAT AT CGG T GGC TT AC TC A 
M :: T GCT TGGAAT T T T T T T CAGAGAGAGT T TAAGTGC TGT GGAG TAG TAT AT T T CAC T GAC TGGT TGG 

O AAATGACAGAGAT GGAC T GGC C C CCAGAT TCC T GC T G T GT TAGAGAAT T CC CAGGAT GT T CC AAA 

Q CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
S TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
fl TCCTGGCCATGATTCTCACCATTACTCTGGTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 

ACAGACCAAATGATG T CCT T GAAGAAT GACAAC TCT CAGC ACCTGTCAT G T CCC T CAGTAGAAC T 
m GTTGAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACT 
K! TTGAGATGGAGGAGT T ATAAA AAGAAATGTCACAGAAGAAAACCACAAACT TGT T TTATT GGACT 

1 1 1 TGT GAAT T T T T GAGT AC AT AC TAT G T GT T T CAGAAAT AT GT AGAAAT AAAAAT G T TGC CATAAAA 

TAACACCTAAGCAT ATACTAT T CTAT GCT T TAAAATGAG GATGGAAAAGT T T CATG T CAT AAG T C 
* Wi ACCACCTGGACAATAATTGATGCCCT TAAAATGC TGAAGACAGAT GT CATACCCACTGTGTAGCC 

TG TGTAT GACT TT TACTGAACACAGT TATGT T T TGAGG CAG CATGGT T T GAT TAGCATT T CC GC A 
|fl T CC ATGCAAACGAGTCACATAT GGT G GGAC TGGAGCCATAG TAAAGG T TGATT TACT TC T AC CAA 

Q C TAG TAT AT AAAG T AC T AAT T AAAT G C T AAC AT AG G AAG T T AG AAAAT AC T AAT AAC T T T TAT T A 

f =; CTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTCAATATTGGTGACT 
JJf ACCTAAATGTGATTT T TGC TGGTTACTAAAATATTCT TACCACTTAAAAGAGCAAGCTAACACAT 

Q T G T C T T AAGC T GAT CAGGGAT T T T T T GT AT AT AAG T C T G T G T T AAAT C T G T ATAAT T C AGT C GAT 

ffj T TCAGTTC T GAT AAT GT TAAGAATAACCAT TATGAAAAG GAAAAT T T G TCCTGTAT AGCATCAT T 

ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TAT T T AAAT AC T T AAC CAC T AAT T T T GAAAAT TACC AG T G TGAT AC AT AGGAATC AT TAT T C 
AGAATGTAGTC T GGT C T T TAGGAAGTAT TAAT AAGAAAAT T TGCACATAAC T TAGTTGAT T CAGA 
AAGGAC T T GTAT GCTGTTTTTCTCC CAAAT GAAGAC T CT T T T TGACACTAAACAC T T T T T AAAAA 
GCT TATCT TTGCCTTCTCCAAACAAGAAGCAATAGT CTCCAAG TCAATATAAAT TCTACAGAAAA 
T AGT GTTCTTTTTCTC CAGAAAAAT GC T TGT GAGAAT CAT T AAAAC AT GT G ACAAT T T AGAGAT T 
CTTTGTTT TAT T TC AC TGAT TAAT AT AC T G TGGCAAAT TACACAGAT TAT T AAAT T T T T T TACAA 
GAGTATAGTATATT TATT TGAAATGGGAAAAGTGCAT T T TACTGTAT T TTGTGTAT T T TGTT TAT 
T TCTCAGAATAT GGAAAGAAAAT TAAAATGTGTCAATAAATAT T T TCTAGAGAG TAA 
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FIGURE 108 



MAREDS VKCLRC LL YALNLL FWLMS I S VLAVSAWMRDYLNNVL TLTAE TR VEEAVI L T Y FPWHP 
VMIAVCCFLIIVGMLGYCGTWRNLLLLAWYFGSLLVIFCVELACGWTYEQELmWQWSDIWT 
LKARMTNYGL PRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVRE FPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRI FEHTSMANS FNTHFEMEEL 



Signal peptide: 

amino acids 1-33 



J 5 ?;; Transmembrane domains: 

s-J amino acids 12-35/ 57-86, 94-114, 226-248 



IS 
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FIGURE loo 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGGCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
, : TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
f~ CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
t"% CAGCTGCTCAGATACCAGC GAGTGCAC ATGC GCTCCAGGAACT GTGCTAGGCCCTGACAGGCAGACATGCTTT G 

'-rr.z-.i- 

HI AT GAAAAT G AAT GT G AGCAAAAC AAC G GT GGCT GC AGT GAGAT C T GT GT GAACCT C AAAAACT CCT AC C GC T GT 

Ul GAGTGTGGGGTTGGCCGTGTGCTAAGAAGTGATGGCAAGACTTGTGAAGACGTTGAAGGATGCCACAATAACAA 
If! TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
III CTGAGGATAACCACACTTGGCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
UJ CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
S CCT CTTCT CT C TC AAGAC AT GT GGT AC AGTGGT C GAT GT GGT GAATGACAAGATT GT GGC CAGC AACC TCGT GA 

y CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
*H ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 

CAT GAGCCGAAATCAT GGGATCT TCCC ATT CACTCT GGAGATCTTCAAGGAC AAT GAGTT T GAAGAGCCTT ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT CAGAAGCT GGGTAT AAT AT TTCAAGTT ACAAACCC TAGAAAAAT T AAACAGT TACT GAAATT AT GA 

CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAATTTGGAAGT GTAT CAAT AAAACAGT ATAT AATT TT 
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FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPVVHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
\Z LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
§ £ g Signal peptide: 

Iff amino acids 1-16 

m 

|l| N-glycosylation sites. 

« amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

IfS 

*-° ff Tyrosine kinase phosphorylation sites. 

J'^ amino acids 411-418, 443-451 

jgjj N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy! ati on site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE 111 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCT GCCTTGGGGT GACAATCT CAGCTCCAGGCT ACAGGG AGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGC AGCT GT GT GACGGAGAGC TGGACT GTC CCT TGGGGG AGGACGAGGAGC ACTGT GTCAAGAGCTTC 
H CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
O AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
^ GCTACAGC AG AG CTGT GGAGAT TGG CCC AG AC C AGG AT C T GG AT GTTGT T GAAATC AC AGAAAAC AGCCAG 

;■ J GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
7m TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 

m 

TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 
i : ii GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
9 CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
|1: TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
111 ' GATGAGGAGCTC ACTC CAGCC ACC CC AC TC TGGAT CAT T GGATGGGGC T T T ACG AAG CAGAATGGAGGGAA 

U GATGTCTGACATACTGCT GCAGGCGTCAGTCCAGGT CATTGACAGCACAC GGTGCAATGCAGACGATGCGT 

ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
if GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
- ' CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 

GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGAT CCCCCAAAGT C AGACACAG AGC AAG AGTCC CCT T GGGT AC AC CC C T CT GCCC AC AGCCT C AGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 



MLQDPDS DQP LNS LDVKPLRKPR I PME T FRKVG I PI 1 1 ALL S LAS 1 1 1 WVL I KVTLDKYY FLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS ILDPHWVLTAAiiCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMS D I LLQAS VQVT DS TRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDS GGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNV>JKAEL 

r% Transmembrane domain: 

pi amino acids 32-53 (typell) 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTC TGAACAACT T T T T CAGCAAC TAAAAAAGCCACAGGAGT 
T GAACTGC TAGGAT TC TGAC TATGC T GT GG TGGC TAG T GC TCC TAC T CCT ACCT AC AT TAAAAT C 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
C AGAAGGGGTC TGC T T AGACC AC CT GGT T TAT GT GACAGGAC T T G CAT T CT CCT GGAACAT GAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C T C AAT T T AAAT CAT G T T C T AGT AAT T G GAGC T G T C C C CAAGAC C AAAG GAG C T AGAG C T T GG T T 
CAAAT GAT C TCCAAGGGCC CT TATACCCCAGGAGAC TT TGAT T T GAAT T TGAAACCCCAAAT CCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAZyVACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTAC TCGGGAGGC TGAGAC AGGAGAAT TACTT GAAC CTGGGAGGTGAAGGAGGC TGAGACA 
G G AG AA T C AC T T CAG CCT GAG C AAC AC AGC GAGAC T C T G T C T C AG AAAAAAT AAAAAAAG AAT T A 
TGGTTATTTGTAA 
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FIGURE 114 
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MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 
amino acids 1-15 
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FIGURE, lis 

C AGCAGT GGTCTCT CAGT CCTC T CAAAGCAAGGAAAGAG T ACTGTGT GCT GAGAGACCATGGGAA 
AGAATCC TCCAGAGAATTGTGAAGAC TGTCACATTCTAAATGCAGAAGCT T TTAAATCGAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
G T T T T GGGGG AG CAAGCACT TC T GGCCGGAGG T ACC CAAAAAAGCC T AT GACAT GGAGCACAC T T 
T C T AC AGC AAT G G AGAGAAGAAGAAG AT T T AC AT G GAAAT TGATCCTGT G AC CAGAAC T GAAAT A 
T TCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACT TTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C TGAACCAGAAGAGGAAATAGAT GAGAAT GAAGAAAT TACCACAACTT T CT T TGAACAGTCAG T G 
AT T TGGGTCCC AGC AG AAAAGCC TAT T GAAAACC GAG AT T T T C T TAAAAAT T CC AAAAT T C T G G A 
GAT T T G T GAT AAC G T G AC CAT GT AT T G GAT C AAT C C C AC T C T AATAT C AG T T T C T GAG T T AC AAG 
ACT T T GAGGAGGAGGGAGAAGAT CT T CACT T T CC T GCCAAC GAAAAAAAAGG GAT TGAAC AAAAT 
GAACAGT GGGTGGTCCCTCAAGT GAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGT GAGGA 
AGAAC T T CC AAT AAAT GAC TAT AC T G AAAATG GAAT AGAAT T T GAT C CC AT G CT G GAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGdTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSE PEEE I DENEE I T TT FFEQSVI W VPAEKP I ENRD FLKMSKILE I CDNVTM YW INPTL.I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWVPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCC I YCRRGNRYCRRVCEPLLG YYPYPYC YQGGRVI CRVIMPCNWWVARMLGRV 

Important features of* the protein: 
Signal peptide: 

amino acids 1-4 0 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 



V: ■:■} 

f|l 242 



Glycosaminoglycan attachment sites . 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 



N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies Oterminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 

GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACT.CTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
M TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
% J GATGGT GCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 

TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 

CP 

fl GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
m C T T C C AAGCACG ACT ATGTGTAAT GC T C T AAG AC CT CT CAGC ACGG GC GGAAGAAACTCC CG GAGAGCTC A 

|yi CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
5 CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
C| GCTGAGT T AT TT AT GAAT T AG AGGCT AT AGCT C ACATT T TC AATCCT C TAT T T CTT T TTT T AAAT AT AACT 

IJJ TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
C3 TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
^ GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

AC AC TT ACTG AAGAAG AAGC AATAAG AGAAAG AT AT T T GT AATCTCT C CAGC CC AT G ATC TCGGTTTTCTT 

ru 

ACACTGTG ATCTTAAAAGTT ACCAAACC AAAGTCAT TT TCAGTTTGAGGCAACC AAACCT TTCT ACTGCT G 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 
AATAAAT AATGTTT T AGT AAAATGAT AC AC TATCT CTGT GAAAT AGCCTC AC CC CT ACAT GT GG AT AG AAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
ATGAAAAGCTCACACC TGT AATCC T AGC ACTT TGG GAGGC T G AG G AGGAAGG AT C ACTT GAGCCC AGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
T GGT GGC AT ACACCT GTAGTCCCAGC AT TC CGGG AG GC T GAG GT GGGAGGAT C AC T TG AGCC C AGGGAGGT 
TGGGGCTGC AGTGAGCCATGATCACACC ACTGC ACT CC AGC C AGGTGACAT AGC GAGATC CT GTCT AAAAA 
AAT AAAAAAT AAAT AATGGAAC AC AGCAAGTC CT AG GAAGT AGGTT AAAACT AATTCTT T AA 
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FIGURE 118 

MS T T T C QWAFL LSI LG LAGC I AAT GMDM W S TQDL Y DNP VT S VFQ YE GLWRS C VRQS S G FTE CRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAIAGVSVFANMLVTNFWMS TANMYTGMGGMV QTVQTRYT FGAALFVGWVAGGL TL IGGVMMCI A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 



Signal peptide: 

f*j amino acids 1-23 

I 

y? Transmembrane domains: 

J|! amino acids 81-100, 121-141, 173-194 

Wl 

m 

W 

13 
If! 
O 

w 
O 
ill 
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FIGURE HQ 



G GAAAAACT GTTCTCTTCTGT GG C ACAG AGAAC CC T G CT T CAAAG CAGAAG TAGCAG T T CCGGAG T C C 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAA.CCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGT GCAC GGGGGACAATGAGAAGGT GAAGGCTCACATT CTGCT GACGGCTGGAAT CATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
C AGATAC TCGATACCT T CCC ATCGCACAACCCAAAAAAGT TAT CACACCGGAAAGAAGT CACC GAGCG 
T C T AC T C CAGAAG T C AG TAT GTG TAG T T G T GT AT GT T T T T T T AAC T T T AC T ATAAAGCC AT GC AAAT G 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AAT CT TAATTACAGGAACT GT GCATCAGCTATT TATGAT TCTATAAGCTAT TTCAGCAGAATGAGATA 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
Q AC G AC AT AG CAT TAT G T AC AT AG AT GAG T GT AAC ATT TATATCTCACATAGAGACATGCT TATATGGT 

||j T T T AT TT AAAAT GAAAT GC C AGT C CAT T ACAC T GAAT AAAT AGAAC T C AAC T ATT GC TT T TCAGGGAA 

O ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 



If! 



TT TAT AAT GAAGAT T AAAAT GAAGGCTT TAAT C AGCATT GTAAAGGAAAT TGAATGGCTT TCT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGT GT ATTAAATTAACAT T TT T AAAAC G C AG AT AT T T T GTCAAGGGGCTT T GCAT TCAAACT GCTT 
T TCCAGGGCTAT ACTCAGAAGAAAGAT AAAAGT GTGATCT AAGAAAAAGTGAT GGT T TTAGGAAAGTG 
AAAAT AT TTTT GTTTTT GTAT TTGAAGAAGAAT GAT GCAT T T T G ACAAGAAAT CAT AT AT GTATGGAT 
AT AT T T T AATAAGTAT T TG AGTAC AGACT TT GAGGTT TCATCAATAT AAAT AAAAGAGC AGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTT TAGTTTTACTAAAATCTGTAAATACT GTATTTTTCT GTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT T TAT AT TAAT AAAT TG T ACAT T T T T C TAAT T 
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FIGURE 120 



MATHALEIAGLFLGGVGIWGTV7OTVMPQW 

I YDS LLALS PDLQAARGLMCAASVMS FLAFMMAI LGMKCTRCTGDNE KVKAHI L LTAG 1 1 FI I TG 
MWLI PVSWVANAI I RDFYNS I VNVAQKRE LGEALYLGWT TALVL IVGGAL FCCVFCCNEKSS S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
T ACAAAGATGCGT T CAAATAGT GC T C T AAGAGT T TT G T TC AGTGGC T CACT T CGGC TAAAATGCA 
GAAAT GCATGC T G T CAGCG T TGGTAT T TCACAT T CAAT GGAGC TGAAT GT T CAGGACC T CT T CCC 
AT T GAAGC TAT AAT T TAT T T GGACCAAGGAAGCC C T GAAATGAAT T C AACAAT TAAT AT T C AT CG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T TGGCACT TGT T CAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTT TC TCGCATC 
AT TAT T GAAGAAC T ACC AAA ATAAA T GC T T TAAT T T TC AT TTGCTAC CTCTTTTTT TAT TATGCC 
T TGGAATGG T T CACT T AAAT GACAT T T TAAATAAGT T TAT G TATACAT C T GAAT GAAAAGCAAAG 
C TAAATAT G TT T ACAGAC C AAAGT GT GAT T T C ACAC T GTT T T TAAAT C T AGC AT TAT T CAT T T T G 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAA^AGC T ACCAATC T T T GTACAAT T TGTAAAT GT TAAGAAT T T T T T T TATAT CT GT 
T AAATAAAAAT T AT T T CCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVL FS GS LRLKCRNAC CQRWYFT FNGAECS GPLP I EAI I YLDQGS PEMNS T I N I HRT S 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATT GC C T GGAAGAATACATCATGTT T T TCGATAAGAAGAAAT T G TAGGAT C CAGT TT T T T T T T TA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGG ATG GGTTTCAATG 
T AAT T AGGC TACT GAGCGGATCAGCT GTAGCAC TGGT T AT AGCCCCGACTGT CT TACTGACAAT G 
, : CTTTCTTCTGCCGT^ACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
12 ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
U ATAACAGC C TT CAAAAAC T TAAGTAT AAT CAAT T TAAAGGGCTCAAC CAGC T CACC TGGCTATAC 

O CT T GACCAT AAC CATATCAGCAAT AT T GAC GAAAAT GC TT T T AAT GGAATACGCAGAC T CAAAGA 

|f| GC T GAT T C T TAG T T C C AAT AGAAT C T C C TAT T T T C T T AAC AAT AC C T T C AG ACC T G T G AC AAAT T 

711 TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
?!: CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
yj AGACTGCCGCAACCTGGAACT T T TGGACCT GGGATATAAC C GGAT CC GAAG T T TAGCCAGGAATG 

CO T CT TT GCTGGCATGAT CAGAC T C AAAGAAC TT CACC TGGAGCACAAT CAAT T T TCCAAGC T CAAC 

|J CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
"J CATAGGACAGACCATGTCC TGGACCTGGAGCTCC TTACAAAGGCT TGAT TTATCAGGCAATGAGA 

TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
W TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
HI TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
P TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
|Ts AT CGATGC AGT GAAGAAC T ACAGCAT C TGT GGCAAAAGTAC TACAGAGAGGT T T GATC T GGC CAG 

t£ GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
y TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
III TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 

CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTC CC TAAAGCAAAT GACT CCCAGC AC CC AGGAAT T T TAT G T A 
GATTATAAACCCACCAACACGGAGACCAGC GAGATGC TGC T GAAT GGGACGGGACC CT GCACCTA 
TAACAAAT CGGGC T CCAGGGAGT GT GAGGTATGAACCAT T G TGATAAAAAGAGC T C T TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CAT TTTCCTCT CATAC ATAATCAACCCAT T GAAAT T T AAATACC AC AAT CAATGT G AAGC T T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 
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FIGURE 124 

mgfnvirllsgsavalviaptvlltmlssaergcpkgcrcegkmvycesqklqeipssisagclg 
lslrynslqklkynqfkglnqltwlyldhnhisnidenafngirrlkelilssnrisyflnnt.fr 
pvtnlrnldlsynqlhslgseqfrglrkllslhlrsnslrtipvrifqdcrnlelldlgynrirs 
larnvfagm i rlke lhlehnqfsklnl al fprlvs l qnl y lqwnk i s vi g qtms wt ws s lqrldl 
sgneieafsgpsvfqcvpnlqrlnldsnkltfigqeildswislndislagniwecsrnicslvn 
wlks fkglrent 1 1 cas pkelqgvnvi davknys i cgks tter fdlaralpkpt fkpklprpkhe 
skpplpptvgatepgpetdadaehisfhkiiagsvalflsvlvillviyvswkrypasmkqlqqr 

SLMRRHRKKKRQSLKQHTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 4 65-4 68 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myri s toylation si tes . 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGTTATCGTCTTGCGCTACTGCTGA ATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAA7\AACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGC T T TC TAAAGC T T T GGCAAGGAG T GAC AC CCGC CAT T T ACAGACACG T AGT 
GTAT T C T GGAGGTC GAATGGTCACATAT GAACATCT CCGAGAGGTT GTGT T TGGCAAAAGT GAAG 
, . ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
p d T TAGCC AAT C CAACT GACCTAGT GAAGG T T CAGAT GCAAATGGAAGGAAAAAGGAAAC T GGAAGG 

Q AAAACCAT TGCGAT TT CGTGGT GTACAT CATGCAT T T GCAAAAATC T TAGC TGAAGGAGGAAT AC 

GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
f£ ACCACTTATGATACAGTGAAACACTACTTGGTATTGAATACACCACTTGAGGACAATATCATGAC 
%l TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
|J TCAAAAGCAGAATAAT GAATCAACCACGAGATAAACAAGGAAGGGGACTTT TGTATAAATCATCG 

9 ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
13 ATCT T GGC TGAGAAT GACC CC T T GGT CAATGG TGTTCTGGCT TAC T TAT GAAAAAAT CAGAGAGA 

f2. T G AG T G GAG T CAG T C C A T T T TAA 

m 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELAT FPL DLTKTRLQMQGEAALARLGDG ARES 
APYRGMVRTALG 1 1 EEEGFLKLWQGVT PAI YRHWYS GGRMVTYEHLRE WFGKS E DEHYPLWKS 
VI GGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGG I RGLWAGWVP 
NI QRAALVNMGDLT T YDT VKHYLVLNTPLE DNIMTHGLS S LCS GLVAS I LGT PADVI KSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

!„_.-. Transmembrane domains: 

□ amino acids 25-38, 130-147, 233-248 

m 

y i 
ill 



hi 

o 
FU 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
z , TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
t*% GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
Q AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
*? 3 CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
™ GAGGCAAGAGCT TC GGGAGGCCCAGAC CCACATC CGAGCC ACG T GCAT C T C TCCAGGT G T GGT GG 

%l AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
lil ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
* CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
J-f TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
JLf CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
iH TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
n TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
fll T CT TCCC T T TGA.CAT GG GAAAGG AG T T G TGGCC AAAAT CCC CAT C T T C T T GCACC T CAAC G T C T G 

TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGME RWRDRLALVTGASGG I GAAVARALVQQGLKWGCARTVGNI EELAAECKS AGYPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINMAGIARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIIKINSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAYIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
I Signal peptide: 

Q amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
m 199-205 

uj 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTAC ATG GGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
C CGT T GAG T GC T T GAAATGAG GAAC TG AGAAAAT TAAT T T C T C ATGT AT T T T T C TCAT T TAT T T A 
fa T TAAT T T T T AAC T GAT AG T T G T ACAT AT T T GGGG G T AC AT G T G ATAT T T G GAT AC AT G T AT ACAA 

O T AT AT AAT GAT C AAAT C AG GG TAAC T G G GAT AT C CAT C AC AT C AAAC AT T TAT T T T T TAT T C T T T 

W TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
Y[ I CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
m GCACCACAATGC CC AACTAAT T T T T G TAT T T T TAGT AGAGACGGGGT T T TGCCAT GT T GCCCAGG 

m CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
W GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T TCCATCCATGT TGCT GCAAAT GACAGGAT TTCGTTCT TAAT T T CAAT TAAAATAACCACACAT G 
GCAAAAA 
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FIGURE 13Q 



MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of* the protein: 
Signal peptide: 

amino acids 1-18 



Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 63-79 



f) N-myristoylation site. 

C*f amino acids 86-92 

fit 
y 
in 

CO 

w 

55 

o 

ifl 
S3 
W 
O 
fil 
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FIGURE 131 



TT CT GAAGT AACGGAAGCT ACCTT GT AT AAAG ACCT CAAC AC TGCT G ACCATGATCAGCGCAGC CTGG AG C 

ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 

TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 

GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 

TTGAAAAACTTGCTGAAAGTAGAAAGAATATACCTATACCACAACAGTTTAGATGAATTTCCTACCAACCT 

CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 

AAAT TC CC TAT CTGGAAGAAT T AC AT T T AG ATGACAACT CT G TCTCTGC AGT T AGC AT AGAAGAGGGAGCA 

TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 

GCCC AGGACT AT AGAAGAACT ACGCT T GGATGAT AATCGC AT AT CC AC T AT T T CAT CACCATCTCTTC AAG 

GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 

TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 

CT T ATCT AAGGCAGCT CT ATCG AC TGGAT ATGT CCAAT AAT AAC CT AAGT AAT T T ACCT C AGGGT ATCT T T 

GATGATTTGGACAATATAACACAACTGATTCTTCGCAACAATCCCTGGTATTGCGGGTGCAAGATGAAATG 

GGTACGTGACTGGT TACAATCACT ACCT GTGAAGGTCAACGTGCGTGGGC TCATGTGCCAAGCCCCAGAAA 

AGGT TC GTGGGAT GGCT AT T AAGGAT CT C AAT GC AG AACT GT TT G AT T GT AAGG AC AGTGGG ATTGT AAGC 

ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 

CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 

CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 

AC T G CT T TGAGAC T CAGCT GGCTT AAAC TGGGCC AT AGCC CGGC ATTT GGAT CT AT AACAGAAAC AAT T GT 

AACAGGGGAACGCAGT GAGT ACTT GGTCAC AGCCCT GGAGCCTGATTCACCCTATAAAGT AT GCATGGTTC 

CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 

CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 

TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 

T TCAT AGG AAT GGATC GC T CTTCT CAAG GAACTGTGC AT AT AGC AAAGGGAGGAGAAG AAAGG AT G ACT AT 

GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 

AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 

AC AAAAAC AAT C AC AGT GAAAGCAGT AGT AACCG AAGCT AC AGAG ACAGT GGT ATTCCAGAC T C AG ATC AC 

TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 

GT 
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FIGURE 132 



MI 3 AAWS I FL I GTKI GLFLQVAPLSVMAKSCP S VCRCDAGFI Y CNDRFLT 5 1 PTG I PEDATTLYL 
QNNQ INNAG I PS DLKNLLKVER I YLYHNS LDE FPTNLPKYVKELHLQENNI RT I T YDSLSKI PYL 
EELHLDDNS VS AVS I EEGAFRDSNYLRLL FLSRNHLS T I PWGL PRT I EELRLDDNRI ST I S SPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAIIGGAVALVT IALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 



in ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

2.S 



Important features of" the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Ami da ti on site. 

amino acids 567-570 
Leucine zipper pattern, 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
. GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
M GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
^ TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
HI C T C CAC T ACC T C AAGC TGAGTGATC CC AA.G TACC TAAGAGAGT T CCAGCT GACC C TC CAGC CCGG 

|j| GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
4f| C CCAGGAC TCAT TCTCAGAGGAGAGAAGTGACGTGT GCCT GGTGCAGCTGCTGGGAACCGGGACG 

SO GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
^ AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 

GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 
m AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 

4;? q 

O TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
Ill AAC AGCAGGAAGGAT GCT T CGGGGAGC CT GATGC TGAAGATGAAGAAT T AT CTAAAGC TAT T CAA 

W TAT CAGCAGC AT TT T T CGAGGAGAGT GAAGAGGCGAGAAAAACAAT T T C CAGAT TC TC GCT CT GT 

TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
, ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGAT TTGATCCCGTTCGAGTGGATATCACTT CGAAAGGAAAAAT GAGAGCAAGATAT G 
i U T GAAT TACATCAAAAC ATCAG AGG T TGT CAGACT GC CC T AT CC T C T C CAAATGAAATC T T CAG G T 

Q CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
tl TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
f I C TGACAT GAGAC GGGAAATGGAGCAG T CAAT GAAT ATGCT GAAT TCC AACCATGAGT TGCC T GAT 

l\* GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
m TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 

"3"";? 

||| AAACAC GG C AAC AC T GGG T GG CAT C CAAGT C T T GG AAAAC C GT GT GAAGCAAC TAC T AT AAACT T 

* GAGTCAT C CC GACGT T GAT CTC T TACAAC TGT GTAT G T T 

J ; jf AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 

f*\ GAGG T CAATAT T GATGTCACTGAAT TAAT TACAG T GT CCT ATAGAAAAT GCCAT TAATAAAT TAT 

y 1 ATGAACTACTATACATTATGTATATTAATTAAAACATCTTAATCCAGAAATCAMAAAAAAAAAA 

Q AAAAAAAAAAAAAAA 

ffl 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGA& 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYWEWSPAYR 
WRLPYPLQMKSSGPPSYFIKRESWGWTDFI^PMVMM^ 
QSMNMLNSNHE LPDVSE FMT RLFS SKSS GKS S S GS SKT GKSGAGKRR 

Important features of the protein: 
Signal sequence: 

Q amino acids 1-23 

ill 

^ a Transmembrane domain: 

Tl amino acids 161-182 

\l :} 
\%£ 

Id N-glycosylation site. 

* amino acids 184-187 

!~ Glycosaminoglycan attachment sites. 

s..;! amino acids 37-40, 236-239 

f|| cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 



N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGC TACGTT GGC T T TC T GGAAGGGG AGGCTATATGC G T CAAT TCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
TGCAAGCCAAAGAT GGAAACAT T GACATCAGAAT C T T AAGGAGGAC T GAGT CT TTGCAAGACAC A 
P AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
JJj AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
m- CCATCAAGAAGGACCTC CGGC TCT CT C ATGCC CACATGACAT GCC AT TGT GGGGAGGAAG CAAT G 

|i| AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
a T TT GGGGGAAC TAGAC AT TC T TCT GCAAT GGAT GGAGGAGACAGAATAGGAGGAAAGTGAT GCT G 

CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
ACC ACC AT C T C T T TACT GT AC TAGT CTTGTGCTGGT CAC AG T G TAT C T TAT T TAT GC AT T AC T T G 
C T T CC T T GCATGAT T GT C TT TAT GC AT CCC CAATC T T AAT TGAGACC AT AC T T G T AT AAGAT T T T 
T G T AAT AT CTTTCTGC TAT T G GAT AT AT T TAT TAG T T AAT ATAT T TAT T TAT T T T T T GC T AT T T A 
A.TGT AT T TAT T T T T T TACT T GGACAT GAAAC T T TAAAAAAAT TC ACAGAT T ATAT T TATAACC T G 
AC T AGAGC AG G T GAT G TAT T T T TAT AC AG T AAAAAAAAAAAAC C T T G T AAAT T C T AGAAG AG T GG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAAT CCTACACGGCCAGCATGTAT TTCTACAAATAAAGT T TTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSYQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SIANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51/ 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
A.CCACC T C T C T T CAGT AAAGT TGT TAT T GT T C TGAT AGAT GCC T TGAGAGAT GAT TTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCT GAAGC AAAGCCACCTACAG T TAG TAT GCCT CG AATC AAGGCAT T GAT GACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
T AAGACAAGCAAAAGCAGC T GGAAAAAGAATAGT CT T T TAT GGAGAT GAAACC T GG GT TAAAT TA 
|, K : : TTCCCAAAGCATTTTGTGGAATATGATGGAACAACCTCATTTTTCGTGTCAGATTACACAGAGGT 
Z GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
2* TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
U AAGC T GAGCGAGAT GGACAGCGTGC TGAT GAAGATC CACACC T CACT GC AGTCGAAGGAGAGAGA 

Cfl GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
Ui GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
in CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
%i ACT T GGCTTACCGAT T C CAAAAGACAGT GTAGGGAGCCT CC TAT TCCCAG T TGT GGAAGGAAGAC 

? fi ■ C AAT GAGAGAG C AGT T G AGAT T T T T ACAT T T GAAT ACAG T GC AGC T T AGT AAAC TG T T GC AAG AG 

W AAT G T G C C GT C AT AT GAAAAAGAT CCTGGGTTT G AGCAG T T T AAAAT G T CAGAAAGAT T G CAT GG 

■g GAAC T GGAT CAGAC T GT ACT TGGAGGAAAAGCATTCAGAAGT CC TAT TCAACC TGGGCTC CAAGG 

f-p TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
?!? TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
jl 1 CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
C3 CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
lyl GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
p AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
IHi TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
Iy CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 

TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C TGCACACAG TATGTAG T TACCAAAAGAATAAACGGCT^AT AAT T GAGAAAAAAAA 
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FIGURE ldo 

MRLG S G T FAT CC VAI E VLG I AVFLRG FFPAPVRS SARAEHGAE PPAPEP SAGAS SNWT TL PP PL F 
SKWI VL I DALRDDFVFGSKGVKFMP YT T YLVEKGAS HS FVAEAKPP TVTMPR I KALMTGS LPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLL VLCGDHGMSETG SHGAS S TEEVNT PL I LI S SAFERKPGD I RHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites . 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site . 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTG ATG TTACTGCTGCTGTTGGAGTACAACT 
T CC CTATAGAAAACAAC TGCC AGCACCTTAAGACCACTCACACCT TCAGAGTGAAGAACT TAAAC 
CCGAAGAAAT TCAGCAT TCAT GACCAGGATCACAAAGTACTGGTCCT GGACTCTGGGAATCTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T G T GACAAGGAT AAAGGACAAAGTCAT CCATCCC T T CAG CTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 

w 

fl ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 

ffl GT T GGGGTGACAGATAAAT T T GAGAACAGGAAACACAT TGAAT T T T CAT T T CAACCAGT T TGCAA 

\ „ n 

fl AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T GAAC TAAT T GTATAAAAACACCAAAC CTGC TC ACT 

wl 
ill 

fil 
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FIGURE 14.2 



MLLLLLE YNFPI ENNCQHLKTTHT FRVKNLNPKKFS IHDQDHKVLVLDSGNLI AVPDKNY IRPE I 
FFALASSLS SASAEKGS P I LLGVSKGE FCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEHSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 

amino acids 33-36 

N-myristoylation site. 



%J amino acids 50-55, 87-92 

& 
5; ; S 

Interleukin-1 

m amino acids 37-182 



i n 
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FIGURE 143 

CTAGAGAG T ATAGGGCAGAAG GAT GGC AGAT GAG T GAC TC CAC AT CC AGAGC T GCC T CCC T T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
C AGTGGAGT T T GCT G T C CACACAT T CAACCAACAGAGC AAGGAC T AC TA.T GCCTACAGACT GGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAACTAGGTGTGGGAAATTTGAAGACGACATTGACAACTGCCATTTCCAAGAAAGCACAG 
AGC TGAAC AATAC T TT CACC T GCT T CT T CACCATCAGC ACCAGGC CC TGGATGAC TCAGT T CAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCT T GAGCC TAAT CATGT AGT GT AGAT CAT T AAAC AT CAGCAT T T T AAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MLGLPWKGGL S WALLL L LLGS Q I LL I YAWHFHEQRDCDEHNVMARYLPATVE FAVHT FNQQSKDY 
YAYRLGHILNS WKEQVESKTVFSMELLLGRTRCGKFEDDI DNCHFQES TELNNTFTCFFT I STRP 
WMTQFSLLNKTCLEGFH 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 



■ : 
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FIGURE 14* 

C T G T G C AGC T C G AGG C T CC AG AGG C AC AC T C C AG AGAG AG C CAAG G T T C T GAC G C GATGAG G AAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GAC T T C GGAGCC GAG GGCAACAGG T AC T AC GAGGCC AAC T AC T GGCAG T TC CCC GAT G GC AT CC A 
CTACAACGGC TGCT CT GAGGC TAATGT GACCAAGGAGGCATT TGTCACCGGCTGCATCAATGCC A 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
C CGT CAC T CCAT T C T CAGCCC CT AGCAGAGCG T C TGGCACAC T AGAT T AGT AGTAAATGCT T GAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GTGTAT T CCC G T T T TAC AGAT AAGGAAACT GAGGCC CAGAGAG CTGAAG TACT G CACC CAG CATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CTCTAATGAAAT TGTGAAAGC TCCATGTT TAGAAATAAAT GAAAACACCTGA 



FIGURE 146 

MRKHLS WWWLATVCMLL FSHLSAVQTRG I KHRI KWNRKALP S TAQI TE AQVAENRPGAFI KQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Ami da ti on site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGT TTGT TT TATGAGTATAT TGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE izl8 



MFRSSLLFWPPLCLLSLFLLIL I SS I YSESCKLS I FHFACQWGRSLSLS FY FLKFQLSDSGGTCE 
GLFYEYIA 



Important features of the protein : 
Signal peptide: 

amino acids 1-25 

H 

O N-myristoylation site. 

— 1 amino acids 62-68 

W 

W 
If! 

m 

M 

O 
If! 
O 
W 
O 

ru 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCT ACATAATAGAGAATTAAGTGCAGAAAGAC CT T TGAAT GAACAGATTGC TGAA 
GCAGAAGAAGACAAG AT TAAAAAAAGAT ATCC T CCAGAAAACAAGCCAG GT CAGAGCAACTAT T CTTT 
M TGT T GATAACTTGAACCTGC T AAAGGCAATAACAGAAAAGGAAAkAATT GAGAAAGAAAGACAAT CTA 

W TAAGAAGCTCCCCACT TGAT AATAAGTTGAATGT GGAAGATGTT GAT T CAACCAAGAATCGAAAACT G 

Sf ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 
?l T C AACT AGACGGGACT CCT T TAACCGCTGAAGACATT GT CCATAAAAT C GCTGCCAGGAT T TAT GAAG 

2 f 1 AAAAT GACAGAG CCG T GTTT GACAAGAT TGTTT CTAAACTACT TAAT CTCGGCCT TATC ACAGAAAGC 

fg CAAGC ACATACACTGGAAGATGAAGTAGCAGAGGTT T TACAAAAAT T AATC T CAAAGGAAGCCAACAA 

W TTAT GAG GAGGAT CCCAATAAGCCCACAAGCTGGACT GAGAATCAGGCT GGAAAAATACCAGAGAAAG 

s TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
TTAACCT TGACAAATGGCTT GGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTTGAGGAACTCCA 
%l ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
|Ts CAC TGAT TACT AT CAT GAAAACAC T GATT GAC T TT GT GAAGAT GAT GGT GAAATAT GGAACAAT ATCT 

Q CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
f|J AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 

CAGACAGTACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAAT AT GGAAGC T T GAAGGATT CCACAAAA 
GAT GATAACTCCAACCCAGGAGGAAAGACAGAT GAACCCAAAGGAAAAACAGAAG CCTAT T TGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGAT GAGAGACTT CATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
C TGTTT CAGAAAACATAAT ATAGC T TAAAACAC T TCT AATTCT GT GAT TAAAATT T TTT GACC CAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE 1*0 



HI 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTPCNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 



H7 ■'■ 

|,;; N-glycosylation sites: 

111 amino acids 68-71, 346-349, 350-353 



Casein kinase II phosphorylation site: 

amino acids 70-73, 32-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAG TGCC TGTCAT GT GGGGTGGGGCAGGAGC CGAC TC TAACACTAGAGCCAGT GAACAT CAT GG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCGAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACT.TCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
- : GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
|^ TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
O CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
Q CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
ffl TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
JJf GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
41 1 TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
fl| GAAATGGCTCGAGCTCAGAAGATAAAAGATAAGTAGGGTATGCTGATCCTCTTTTAAAAACCCAA 
ygj GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
*" AGAC C TAC T TAC AAAG T GG CAT AT AT T G CAAT T TAT T T T AAT T AAAAGATACC TAT T TAT AT AT T 

* T CT T T ATAGAAAAAAGTCTGGAAGAG T T TACT T CAAT T G TAGC AATGTCAGGGT GG T GGC AGTAT 

Q AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
HI AT T CCT TG TATAAAAATAAGAAAAGAAAT TAATC T TGAGGTAAGCAGAGCAGACATCATC TC TGA 

TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
^'h T TGT AGT AGT GAT CAGGAAAC AGAT C T CAGCAAAGCCAC T GAGGAGGAGGCTGT GC T GAGT T TGT 

GTGGCTGGAATCTCTGGGTAAGGAACTT71AAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
Q GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
f|| ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGATGAAGGCAGAGATCGGAGTTTTGCAGCCACAAGCTAAGAAACACCAAGGATTGTGGCAACC 
ATCAGAAGCTTG GAAGAGGCAAAGAAGAAT TC T TCCCT AGAGGCT T T AGAGGGAT AACG GCT CT G 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCAGAG CCACAT TAGT GAACCTAGAAGCAGAGACT C T GTGAGATAAT CGAT GTTTGTTGTTTT 
AAG T T GC T C AG TTTTGGTC T AAC T T G T TAT GC AG CAAT AGAT AAAT AAT AT G CAGAGAAAGAG 
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FIGURE 1*2 



irl 



m 



5?' 

2 ft 



MVL SGALC FRMKDS ALKVL YLHNNQLLAGGLHAGKV I KGE E I S WPNRWLDAS L S P V I LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 



111 
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FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
T T C CAGCAGCCCTATATCACCAACC GCACC T TCAT GCT GGC T AAG GAG GCT AGC T T GGCT GATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGT GATGACCTGCATATCCAGAGGAATGT GCAAAAGCTGAAGGACACAGTGAAAAAGC 
T T GG AGAGAGTGG AGAGATCAAAG C AAT TGGAGAAC TGG AT T TGC TGT T TAT GT CT C T GAGAAAT 
G CC T GCAT T TGACCAGAGCAAAGCT GAAAAATGAAT AACTAACCCCC TT T C CC TGCTAGAAATAA 
C AAT T AGATG CCCCAAAGCGAT T T T T T T TAACCAAAAGGAAGAT GGGAAGC CAAAC T CCAT CAT G 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AG AC C AGAAG G T AG AC T T T C T AAGC AT AG AT A T T TAT T G ATAAC AT T T CAT T G T AAC TGGTGTTC 
T AT ACACAGAAAAC AAT T TAT T T T T T AAAT AAT TGTCTTTTTC C ATAAAAAAGAT T AC T T T C CAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
T G T AT T TAT TAT TAT T AT AAGAC T G CAT T T TAT T TAT AT CAT T T TAT T AAT AT G GAT T TAT T TAT 
AGAAACATCAT T CGAT AT TGC TACT TGAGTGTAAGGC TAATAT T GAT AT T T AT GACAATAAT TAT 
AGAGCTATAACATGT TTAT T T GACC TCAATAAACAC T T GGATATCCC 
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FIGURE 1*4. 



MAALQKS VS S FLMGTLAT S CLLLLALLVQGGAAAP I S S HCRLDKSN FQQPY I TNRT FMLAKEAS L 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCH I EGDDLH I QRNVQKLKDT VKKLGESGE I KAI GE LDLL FMS LRNAC I 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 



U N-glycosylation sites. 

J"! amino acids 54-58, 68-72, 97-101 

fft N-myristoylation sites. 

W amino acids 14-20, 32-88 



Prokaryotic membrane lipoprotein lipid attachment site* 

amino acids 10-21 
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FIGURE 155 



U.. 



W 



HI 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCC TGAGGGC T G T GCTGAGAGGGAGAGGAGCAGAGAT GC TGC TGAGGGTGGAGGGAGGC CAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACAC CT C T GAGGAGC TGCTGAGGTGGAGCAC TGTGCCTGT GCCT C CC CTAGAGCCT G C TAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCGACA 
AG'GGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAG CCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCAC TT GCCAT GAAGGGC CAGGAT GCCCAGAT GCT T GGC CCC T GT GAAGT GC TGTC TGGAGCAG 
C AG GAT C C C G G G AC AG GAT GGGGGGCTTTGGG GAAAAC C T G C AC T T C T G C ACAT T T T G AAAAG AG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T T GACAT G GAT GAT T C T GAGG AG G AAGC T G T TAT T G AAT G T AT AG AGAT T TAT C CAAAT AAAT AT 
CTTTATTTAAAAATGAAAAA 
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FIGURE is6 



MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 



||| Tyrosine kinase phosphorylation site. 

W amino acids 127-135 



N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCGATCTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGC T GAACACAG T CT AT T TGAT T GGGGCCCATAAT AT TC C TAAT GC AAAT AT GAAT GAAGATGG 
C CCT TCCATG TCTGT GAATT TCACCTCACCAGGCTGCCTAGACCACATAAT GAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GT AGAAGT GAAC T T CACAACC AC T C CC C T GGGAAAC AG AT ACAT GGC T C T TATC CAAC AC AGCAC 
TAT CATCGGGT T T T CT CAGGT GT T T GAGCCACACCAGAAGAAACAAACGCGAGC TT CAGT GGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
T CACAC AAT T T GT T AC T T C AC T GAAT T T C T T C AAAACC AT T GC AGAAG T GAGG T CAT C C T T GAAA 
AG T G GC AGAAAAAGAAAAT AG CAGAGAT GGGT C C AG T GCAG TGGCTTGC CACT C AAAAGAAG G CA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATC TAAGAAGCCAGAT TCAT CT GCACAAAT ACGT GG T GGT CTAC T T TAGAGAGAT TGAT ACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 



FIGURE 1*8 
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:: v'-'i 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSWETSPGCLDHIMKYKKKCVKAGSLWDPNITACKE<NEETVEVNFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASVVIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIECKTS FSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADBC^'FLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYWVYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 
amino acids 1-14 

Transmembrane domain : 

amino acids 290-309 



til N-glycosylation sites. 

W amino acids 67 - 71, 103 - 107 r 156 - 160, 183 - 187, 197 - 201 and 283 

!!l - 287 

Pi I 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AG C CAC GAG C G C AAC AT GA CAG T GAAG ACCC T GCAT GG C C C AGC C AT GG T C AAG T AC TTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGT TT CCATGT CACG TAACATCGAGAGCCGCTCCACCTCCCCCT G 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACT CAGCTGAAGAAG 



LI 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPPCVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC TGAAGAT GAGGAAAAG T T T GGAGGAGCAGC TGAC TCAGGGGTGGAGGAGC CT AGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
H TGGAGGTGGAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
|*| TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
f% GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
|f ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
fl GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
W CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
Iff CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
f|| CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
r§ GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
W TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
m AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
O CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
|fl GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
'f4 CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
Pi CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CI CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
p| CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 

CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
C GGG AC T TAA AT AAAGGC AGACG CTGTTTTTC T AAAAAAA 
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FIGURE 162 



MP VPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPG PVIiA.PT HLQT EL V 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDN\''HLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWAC PMDKY IH KRWALVWLACLL FAAAL SL I LLLKKDHAKGWLRLLKQDVRSGAAARG 
H RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
Q GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDS Y FHPPGTPAPGRGVGPGAGPGAGDGT 

vpa Signal sequence: 

fi\ amino acids 1-20 

hi 

3 Transmembrane domain. 

O amino acids 453-475 

\t% 
%f % 

f*k 

N-glycosylation sites . 

&f amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 

%f s 334-337 f 357-360, 391-394 

'zz 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACATC CTGACGT GGGACAGCGGGCCAGAGGGCAC CG CAGACACGGTC TACAGCAT CGAG TATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AAC C TGACGG T GGAGACGGGC AACC TCACGGAGCTC TAC TAT GCCAGGGT CACC GC T 

GT CAGT GCGGGAGGCCGGTCAGC CACCAAGATGAC T GACAGGT TCAGCT C T CT GCAGC ACACTAC 

C C T CAAGCCACC TGAT G T GAC C T GT AT C T CC AAAG T GAG AT CG AT T C AGAT G AT T G T T CAT CC T A 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

TTCTACCACTTAGAGCTCCAGGTCAACCGCACCTACCAAATGCACCTTGGAGGGAAGCAGAGAGA 

ATAT GAGTTCT TCGGCC TGACCCCT GACACAGAGT T CCT TGGCACCATCATGAT TTGCGT TCCCA 

C CT GGGCCAAG GAGAGT GCCC CCTACAT GTGC CGAGTGAAGACAC T GCCAGACC GGACAT GGAC C 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CAC CT TAGGC CT AAAGG T CAGC T TCAGAAAGAGCCACCAGC TGGAAGCT GCAT GT TAGGT G GCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

C CC T GG GGAT T T G C ACAGACAGAAC AT C T GACCCAAAT G T GC T ACAC AGT G GGG AGGAAGGG AC A 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

T GC TGGAGTCCCT T G T GTGTCCCAAGGAT GAAGCCAAGAGCCCAGCC CC T GAGACCT CAGAC CT G 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GCC T GGAAAAGAAC CAGGCCT GGAAAAGAACCAGAAGGAGGCT GGGCAGAACCAGAACAACCTGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

T TCCCAGC CAGGGC AAC T GCC T GAC G T TGCACGAT T T CAGC TT CAT T CCT CTGATAGAACAAAGC 

GAAATGCAGGT C CACCAGGGAGGGAGACACACAAGCC TT T TCT GCAGGCAGGAG T T T CAGAC CC T 

ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 

ATC T GTAATGTGGGGAT CATAACAC C T ACC TCATGGAGT T GTGGTGAAGAT GAAATGAAGTCAT G 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQWRTYQMHLGGKQREYE FFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLT FQPLRFI QEHVL I PVFDLS GP S S LAQPVQ YS Q I RVS G PRE PAGAPQRHS LSE I T 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 



Signal sequence. 



||| amino acids 1-17 

■ 

Transmembrane domain. 

*■« amino acids 233-250 



|s| N-glycosylation sites. 

Q amino acids 80-83, 87-90, 172-175 



N-my r i s t oy 1 a t i on sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 



AppJD= 10063 583 



Page 316 of 320 



FIGURE 16* 



TGGCCTACTGGAAAZ\AAAAAAAAAAAA?W\AAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGT G T T GAAC ACAGT T T T GGATAT T T T CCAAAAGAT T T GAT CAAGGT ACT T CATAAAT AC AC GGA 
AGAAGAGCTACATAT TCCAGCAGAT GAGACAGACTT TGTCTGCTTTGAAGGAGGAAGAGATGAT T 
T T AATAG T T AT AAT G T AGAAG AG C T T T TAG GAT C T T T G GAAC T GGAG GAC T C T G T AC C T G AAG AG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
C AGAGAGCACCGAGGGGCT GC AGGGAC AGCCC T CAGC T CAGGAGAGC CACC CT C ACACGAGC GGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
| £ | ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGC^ATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
« AGACAGAT GC T T ACAAAGTCC T GAAAACAGAAATGAGT CAGAGAGGAAGTGGACAG T GCGT TAT T 

CAT TACAGCAAAGGAT TTCGT T GGCAT CAAAAT C TAAGT T TGT T T TACAAAGAT TGT T T T TAGT A 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 



ill 
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FIGURE 166 



Q 



MAAAPGLLFWLWLGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKTkEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLBCVPGSESRTGNSSPASVER 
EKT DAYKVLKTEMS QRGSGQCVI HYSKG FRWHQNLS L FYKDCF 

Important features of* the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 



C3 amino acids 67-76 

CI! 



N-myristoylation sites* 

amino acids 205-211, 225-231, 277-283 



Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCGACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGC'GGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
G T C C AG C C GAG T GAG C T T T C AG GAG G C AC GCCTGGCTTGT GAG AG T GAG G GAG GAG TCCTCCTCA 
GCCT TGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCAT GTTGCAAAACCTGACAAAACCCGGG 
f' A: ACAGGGAT T T C T GATGG T GAT T T C T GGATAGGGC T T TGGAGGAAT GGAGAT GGGCAAACAT CT GG 

Q TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAAGTGGTACACAG 
f4 ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
m CTT GGGGGTGCCTACCT TTACCAGTGGAATGATGACAGGT GTAACATGAAGCACAAT TATATT TG 

f I CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCC T TATCTTACAAATCAACCAG 

W GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
If! ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
fIJ GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
C CAG AAAAGAAAG T G GCATGGAAG T ATAA T AAC T CAT T G AC T T GG T T CC AG AAT T T T G T AAT T C T 
GGATCTGTATAAGGAATGGCATCAGAAC^TAGCTTGGAATGGCTTGAAATC^O^AAGGATCTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
T T TAAAGAATAT G C T GTGC TAATAAT GGAGT GAG ACAT G C T TAT T T T GC TAAAGGAT GCACCCAA 
AC T TCAAAC T TC AAG CAAAT G AAAT G GACAATGCAGAT AAAGT TGT TAT CAAC ACG T CGG GAG T A 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATAT T GTAT TGAAAT T TACAG T G TGCAAAAGTAT T T T ACC T TT GCATAAGT GT T T GAT AAAA 
AT G AAC T G T T C T AAT AT T TAT T T T TAT GGC AT C T CAT T T T T C AAT AC AT GCTCTTTT GAT T AAAG 
AAACT TAT TAG T GT T GT CAAC TGAAT TCACACACACACAAATATAGT ACCATAGAAAAAGT T TGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AAT AAGAAGC TAT T T CAT T AAG TGT G AT AT AAACC T CC T C AAACAT T T T AC T TAGAG G CAAG GAT 
T G T C T AAT T T C AAT T G T GC AAGACAT GT GCC T TAT AAT TAT T T T TAG C T T AAAAT T AAAC AG AT T 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
T GACATACACAATATAAATCATATGTCTTCACACGT TGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGC AC AGCACACAGACAT T T TAGGAAGGAAAGGAAC TACGAAAT CGTGTGAAAATGGG T TGG 
AACCCAT C AGTGAT CGCATAT T CAT T GAT GAGGGT T TGC T T GAGATAGAAAAT GGTGGCT CC T T T 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 



If! 

N-glycosylation sites. 



amino acids 86-89, 255-258 

cAMP- and c©4P-dependent protein kinase phosphorylation site. 

amino acids 266-269 



Q N-myristoylation sites. 

111 amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 

145, 212-217 
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